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CHIMERIC AUTOPROCESSING POLYPEPTIDES AND USES THEREOF 



FIELD AND BACKGROUND OF THE INVENTION 

The present invention relates to polypeptides having the capacity to display 
5 auto-cleavage, polynucleotides encoding such polypeptides, and uses of such 
polypeptides and polynucleotides for reversibly binding proteins to specific substrates, 
reversibly binding specific substrates to each other, and for splicing amino acid 
sequences. More particularly, the present invention relates to chimeric polypeptides 
capable of auto-cleaving at defined locations, including auto-cleaving resulting in 
10 defined auto-splicing, to polynucleotides suitable for expressing such polypeptides, 
and to methods of using such polypeptides and polynucleotides for protein 
purification, affinity selection of display phages, and post-translational ligation of 
proteins. 

Autoprocessing protein domains, such as inteins and Hogs, have the capacity 
15 to post-translationally auto-cleave or auto-splice flanking polypeptide sequences and 
thereby serve as unique and potent protein engineering tools useful in various 
applications, including protein purification, affinity selection of display phages, 
generation of cytotoxic proteins, segmental modification or labeling of proteins, 
protein or peptide cyclization, and generation of reactive polypeptide termini in 
20 expressed proteins for various biochemical reactions, including protein ligation (Perler 
and Adam, 2000. Curr Opin Biotechnol. 11, 377-83). However, the usefulness of the 
presently available repertoire of autoprocessing polypeptides is hampered by various 
limitations, as described in further detail hereinbelow. 

Inteins are internal protein domains naturally occurring in a variety of host 
25 proteins (Hirata et aL, 1990. J. Biol. Chem. 265, 6726-6733; Kane et aL, 1990. 
Science 250, 651-657; Perler et aL, 1994. Nucl. Acids Res. 22, 1125-1127; Noren et 
aL, 2000. Angew. Chem. Int. Ed. 39, 450). Inteins have been found in organisms 

from all three domains of life, including in yeast and algal chloroplasts (eukaryotes), 

> » 

mycobacteria and cyanobacteria (bacteria), and thermophilic archaea (archaea). So 
30 far, no essential biological role has been shown for inteins, and all of their identified 
functions involve their own preservation and maintenance, with no apparent benefit to 
the host protein and organism (reviewed in Pietrokovski, 2000. Trends in Genetics 17, 
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465-472). At least some inteins are multifunctional, being able to both catalyze their 
own protein splicing and to home a copy of their gene into intein-less alleles (Gimble 
and Thorner, 1992. Nature 357, 301; Chong et aL, 1996. J Biol Chem. 271, 22159). 
Hogs are protein domains found in Hedgehogs which are proteins composed of an 
5 amino terminal Hedge protein domain and a carboxy terminal Hog protein region 
(Aspock G., 1999. Genome Res. 9, 909; Hammerschmidt et aL, 1997. Trends Genet. 
13, 14). Other protein domains, such as various Caenorhabditis elegans carboxy 
terminal domains, are believed to autocatalytically cleave themselves from host 
proteins, thereby modulating the activity of the amino terminal parts (Burglin, 1996. 

10 Curr Biol. 6, 1047; Porter et aL, 1996. Cell 86, 21), similarly to Hogs. 

Members of the intein and Hog protein domain families share the capacity to 
autocatalytically cleave the peptide bond joining them to polypeptides flanking their 
amino terminal ends ("amino terminal cleavage")- Inteins have the further capacity to 
cleave the peptide bond joining them to polypeptides flanking their carboxy terminal 

15 ends ("carboxy terminal cleavage") while splicing polypeptides flanking their amino 
and carboxy terminal ends (termed "exteins"), resulting in self-excision of the intein 
from the host protein, and concomitant ligation of the flanking extein domains with a 
peptide bond. Thus, intein-containing host proteins undergo a switch from an intein- 
containing state to an intein-less state via such a process. Most reported inteins 

20 furthermore also contain an endonuclease domain whose function is to mediate the 
copying of the intein gene into specific unoccupied genomic insertion points, thereby 
enabling intein propagation. 

Both inteins and Hogs share a similar structure fold and contain characteristic 
"Hint" consensus motifs which mediate the biochemical reactions involved in the 

25 autocatalytic activities of these protein domains (Hall TM., 1997. Cell 91, 85; 
Pietrokovski S., 1994. Protein Sci. 3, 2340; Pietrokovski S., 1998. Protein Sci. 7, 64; 
Paulus, 2000. Annu. Rev. Biochem. 69, 447). These Hint motif-mediated biochemical 
reactions are similar in both inteins and Hogs, but are involved in different biological 
processes {Dalgaard et aL, 1997. J Comput Biol. 4, 193; Hall et aL, 1997. Cell 91, 85; 

30 Pietrokovski S., 1998. Protein Sci. 7, 64; Xu and Perler, 1996. EMBO J. 15, 5146). 
The initial biochemical reactions of intein and Hog amino terminal cleavage are 
identical; the peptide bond attaching the amino terminal end of the Hint domain to an 
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amino terminal-flanking sequence is converted into a thioester (or ester) bond, a trans- 
esterification reaction then covalently attaches the sequence flanking the carboxy 
terminal end of the intein, or a cholesterol molecule in the case of Hog proteins, to the 
amino terminal flanking sequence, thereby cleaving the bond attaching the amino 
terminal sequence to the Hint domain. In a process essential for organismal 
development, Hint-mediated autocatalytic excision of the carboxy terminal Hog 
protein domain from the amino terminal Hedge protein domain in Hedgehogs leads to 
covalent attachment of a cholesterol molecule to the carboxy end of the Hedge 
domain, leading to its activation and secretion from the cell (Porter JA. et al. t 1996. 
Cell 86, 21; Porter, JA. et al, 1996. Science.274, 255). In the case of inteins, protein 
splicing is effected sequentially by cleavage of the bond attaching the intein amino 
terminal end to the carboxy terminal extein, ligation of the amino and carboxy 
terminal exteins, and cleavage of the bond attaching the intein carboxy terminal end to 
the carboxy terminal extein. 
1 5 Mechanistic studies have determined the roles of highly conserved residues 

positioned near the intein/extein junctions in the splicing reaction (Chong et al 9 1996. 
J. Biol. Chem. 271, 22159-22168; Xu et al y 1996. EMBO J. 15, 5146-5153; Stoddard 
et aL, 1998. Nat. Struct. Biol. 5, 3). These residues include: the Cys, Ser or Thr 
residue forming the amino terminal end of the intein, which initiates splicing with an 
20 acyl shift; the conserved Cys, Ser or Thr residue flanking the carboxy terminal end of 
the intein, which ligates the exteins through nucleophilic attack; and the conserved 
Asn forming the carboxy terminal end of the intein, which releases the intein from the 
ligated exteins via succinimide formation. The amino terminal acyl shift and the 
carboxy terminal succinimide formation cleavage activities of the intein are separable. 
25 The amino terminal cleavage takes place in two separate steps. In the first step, as 
described above, the peptide bond between the intein and the amino terminal extein is 
converted to a thioester (or ester in some cases). In the second step, the thioester bond 
is cleaved by a nucleophilic attack from the side-chain of the residue flanking the 
carboxy terminal end of the intein, causing a transesterification reaction. 
30 Because the structural information required for splicing exists entirely within 

inteins, and since the process of splicing has no energy requirements (for example 
hydrolysis of ATP), such protein domains can be used in a variety of applications 
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involving intein insertion into foreign contexts. Various methods have been used in 
attempts to control and alter intein-mediated functions. Since endonuclease activity is 
not required for protein splicing, mini-inteins with accurate splicing activity have been 
generated by deletion of this central domain (Derbyshire et al, 1997. Proc. Natl. 
Acad. Sci. USA. 94, 11466; Chong et al, 1997. J. Biol. Chem. 272, 15587; and 
Shingledecker et al, 1998. Gene 207, 187). Also, mutation of residues near the 
intein/extein junctions has been used to alter intein activity, for example, to yield 
isolated cleavage at one or both of the intein-extein junctions (Chong et al, 1998. J. 
Biol. Chem. 273, 10567). 

Thus, the ability to modulate the function of autoprocessing polypeptides such 
as inteins has broad potential application, as described above. In the case of protein 
purification where an autoprocessing polypeptide is used in conjunction with an 
affinity group to purify a desired target protein (Chong et al, 1997. Gene 192, 271- 
281; Chong et al, 1998. Nucl. Acids Res. 26, 5109), purification of a target protein is 
effected by co-expressing the target protein as a fusion protein containing a 
purification tag in one terminal segment, an internal autoprocessing polypeptide, and a 
target protein forming the other terminal segment. Such fusion proteins are exposed 
to affinity purification matrices designed to capture the tagged molecule. The target 
protein is then selectively released from the purification matrix by inducing 
autoprocessing polypeptide-mediated auto-cleavage of the peptide bond attaching the 
target protein to the autoprocessing polypeptide. Such a procedure is advantageous 
since autoprocessing polypeptide cleavage affects the fusion protein only, and thus 
non-specifically bound contaminant proteins are not released into the product stream. 
Furthermore, such a method does not employ contaminating and expensive proteases, 
such as those used in technologies employing protease-mediated cleavage of 
purification-tagged target proteins. The aforementioned strategy forms the basis of 
the protein purification systems such as the commercially available IMPACT-CN 
system (New England Biolabs, Beverly, MA). 

However, prior art methods of using such autoprocessing polypeptides for 
•applied uses have numerous drawbacks. In applied systems such as IMPACT-CN, the 
accessory molecule involved in cleavage of the thioester bond between the intein and 
the extein following amino terminal cleavage must be effected with a strong thiol- 
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containing nucleophile such as 2-mercaptoethanol or dithiothreitol (DTT), both of 
which are strong reducing agents which modify the carboxy terminal end of the 
extein. In such systems, although initial thioester formation is mediated by the intein, 
the actual cleavage of the extein is effected via non-enzymatic chemical cleavage of a 
5 thioester bond by a small nucleophilic molecule, thereby severely limiting the 
maximal reaction rates achievable. While such systems allow carboxy terminal 
cleavage, such cleavage has the drawback of resulting in undesirable amino terminal 
cleavage, thereby requiring the amino terminal fragment to be removed in an 
additional purification step. Furthermore, despite insights into intein structure and 

10 function, modifications often result in unacceptably low activity, poor precursor 
stability, or insolubility (Derbyshire et al, 1997. Proc. Natl. Acad. Sci. USA. 94, 
11466; Chong et al y 1997. Gene 192, 271-281; Shingledecker et al, 1998. Gene 207, 
1 87; Chong et al. 9 1998. Nucl. Acids Res. 26, 5 109). 

Thus, all prior art approaches have failed to provide an adequate solution for 

15 providing autoprocessing polypeptides optimal for protein engineering applications. 

There is thus a widely recognized need for, and it would be highly 
advantageous to have, autoprqcessing polypeptides devoid of the above limitation. 

SUMMARY OF THE INVENTION 

20 According to one aspect of the present invention there is provided an. chimeric 

polypeptide comprising an autoprocessing segment having an amino acid sequence set 
forth by SEQ ID NO: 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 
26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 
49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 

25 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 
95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105 or 106, the polypeptide being capable 
of auto-cleavage. 

According to another aspect of the present invention there is provided a 
polynucleotide encoding a chimeric polypeptide comprising an autoprocessing 
30 segment having an amino acid sequence set forth by SEQ ID NO: 8, 9, 10, 11,12, 13, 
14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 
37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 
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60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 
83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 
104, 105 or 106, the polypeptide being capable of auto-cleavage. 

According to further features in preferred embodiments of the invention 
5 described below, the chimeric polypeptide further comprises an affinity tag capable of 
specifically binding a substrate. 

According to still further features in preferred embodiments, the substratg~is * 
selected from the group consisting of a molecule, a compound, a virus, and a cell. 

According to yet another aspect of the present invention there is provided a 
10 nucleic acid construct comprising a nucleic acid sequence encoding a chimeric 
polypeptide comprising an autoprocessing segment having an amino acid sequence set 
forth by SEQ ID NO: 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 
26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 
49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 
15 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 
95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105 or 106, the chimeric polypeptide 
being capable of auto-cleavage. 

According to further features in preferred embodiments of the invention 
described below, the nucleic acid construct further comprises a promoter sequence 
20 being for directing expression of the chimeric polypeptide in an expression system. 

According to still further features in preferred embodiments, the chimeric 
polypeptide further comprises an affinity tag capable of specifically binding a specific 
substrate. 

According to still another aspect of the present invention there is provided a 
25 method of generating a chimeric polypeptide capable of displaying auto-cleavage, the 
method comprising generating a chimeric amino acid sequence including an 
autoprocessing segment, the autoprocessing segment having an amino acid sequence 
set forth by SEQ ID NO: 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 
25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 
30 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 
71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 
94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105 or 106, thereby producing the 
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chimeric polypeptide capable of displaying auto-cleavage. 

According to further features in preferred embodiments of the invention 
described below, the chimeric polypeptide includes an affinity tag. 

According to a further aspect of the present invention there is provided a 
5 method of purifying a protein, the method comprising: (a) generating a chimeric 
polypeptide including an autoprocessing segment having an amino acid sequence set 
forth by SEQ ID NO: 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 
26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 
49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 
10 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 
95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105 or 106, the autoprocessing segment 
being terminally attached to, or flanked by, an amino acid sequence of the protein, the 
chimeric polypeptide being capable of auto-cleavage when subjected to suitable 
conditions to thereby remove the amino acid sequence of the protein from the 
15 chimeric polypeptide thereby generating the protein; (b) immobilizing the chimeric 
polypeptide to a support; and (c) subjecting the chimeric polypeptide to the suitable 
conditions, thereby purifying the protein. 

According to further features in preferred embodiments of the invention 
described below, the method of purifying a protein further comprises the step of 
20 separating the protein from the autoprocessing segment following step (c). 

According to still further features in preferred embodiments, the support 
includes an antibody or antibody fragment capable of specifically binding the 
autoprocessing segment, and the immobilizing is via the autoprocessing segment. 

According to still further features in preferred embodiments, the chimeric 
25 polypeptide further includes an affinity tag sequence, and the immobilizing is via the 
affinity tag sequence. 

According to still further features in preferred embodiments, the support 
includes a specific ligand of the affinity tag sequence, and the immobilizing is via the 
specific ligand of the affinity tag sequence. 
30 According to still further features in preferred embodiments, the generating the 

chimeric polypeptide is effected by synthesizing a polynucleotide encoding the 
chimeric polypeptide and expressing the polynucleotide in an expression system. 
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According to still further features in preferred embodiments, the expression 
system is a cellular expression system or a cell-free expression system. 

According to still further features in preferred embodiments, the cellular 
expression system is an E. coli cellular expression system. 
5 According to still further features in preferred embodiments, the cell-free 

expression system is an E. coli S30 extract expression system. 

According to still further features in preferred embodiments, the 
polynucleotide comprises a promoter sequence being for directing the expression of 
the chimeric polypeptide. 
10 According to still further features in preferred embodiments, the promoter 

sequence is inducible by isopropyl beta-D-thiogalactoside. 

According to still further features in preferred embodiments, the auto-cleavage 
results in auto-splicing. 

According to still further features in preferred embodiments, the auto-splicing 
15 is auto-splicing of segments of the chimeric polypeptide flanking the autoprocessing 
segment. 

According to a further aspect of the present invention there is provided a 
method of reversibly attaching a first substrate to a second substrate, the method 
comprising: (a) providing a chimeric polypeptide including an autoprocessing 

20 segment having an amino acid sequence set forth by SEQ ID NO: 8, 9, 10, 1 1, 12, 13, 
14, 15, 16, 17/18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 
37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 
60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 
83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 

25 104, 105 or 106 flanked by a first amino acid sequence capiable of binding the first 
substrate and a second amino acid sequence capable of binding the second substrate, 
the chimeric polypeptide being capable of auto-cleavage when subjected to suitable 
conditions, to thereby release the first amino acid sequence from the second amino 
acid sequence; (b) exposing the first substrate and the second substrate to the chimeric 

30 polypeptide, thereby generating a complex including the first substrate attached via 
the chimeric polypeptide to the second substrate; and (c) subjecting the complex to the 
suitable conditions, thereby detaching the first substrate from the second substrate. 
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According to further features in preferred embodiments of the invention 
described below, each of the first and second substrates is independently selected 
from the group consisting of a molecule, a compound, a virus, and a cell. 

According to still further features in preferred embodiments, the molecule is 
5 amylose or chitin. 

• According to still further features in preferred embodiments, the virus is a 
bacteriophage. 

According to still further features in preferred embodiments, the chimeric 
polypeptide includes an affinity tag sequence, and the binding the first substrate or the 
10 binding the second substrate is via the affinity tag sequence. 

According to still further features in preferred embodiments, the affinity tag 
sequence is a maltose-binding domain or a chitin-binding domain. 

According to still further features in preferred embodiments, the 
autoprocessing segment is selected from the group consisting of BILl_cloth, 
15 BIL2_cloth, BIL3_cloth, BIL4_cloth, BIL5_cloth, BIL6_cloth, BIL7_cloth, 
BIL9_cloth, BIL10_cloth, BILll_cloth, 3875_87_magma, FhaBmanha, 
BIL2_neigo, BIL3_neigo, BDL5_neigo, BIL6_neigo, MafBl_neigo, MafB2_neigo, 
B0369+_neimeB, B0372+_neimeB, B0655+_neimeB, A2115_neime, BD.2_neimeC, 
BIL3_neimeC, BIL4_neimeC, BIL5_neimeC, BIL6_neimeC, MafBl_neimeC, 
20 FhaBl_psefl-PfO-l, FhaBl_psefl-SBW25, FhaB jpsesy, SCP1.201_strco, 
39_9_thefus, BILl_gemob, BIL2_gemob, 0709_lepin, 3725_lepin, 3719_lepin, 
o665_myxxa, ol078_myxxa, ol070_myxxa, BILl_strav, BIL2_strav, BIL3_strav, 
BILl_pirsp, BILl_chrvi, BIL1 _glovi, BIL2_glovi, BIL3_glovi, BIL4_glovi, 
BIL5_glovi, BIL6_glovi, BIL7_glovi, o649_versp, o5687_versp, o3395_versp, 

■ 

25 II0519_brume, BIL2_magma, BIL3_magma, BIL4_magma, BIL5_magma, 
BIL6_magma, 06786_metex, 00126_rhoca, 00199_rhoca, 00459_rhoca, 00460_rhoca, 
00746_rhoca, 00949_rhoca, 01216_rhoca, 01374_rhoca, 01523_rhoca, 01524_rhoca, 
02710_rhoca, 03530_rhoca, 4825_rhosp, BIL2_rhosp, BILl_silpo, BIL2_silpo, 
BIL3_silpo, BIL4_silpo, BIL5_silpo, BIL6_.silpo, BIL7_silpo,. BIL8_silpo, 

30 BIL9_silpo, BIL10_silpo, BILH_silpo, BIL12__silpo, BIL13_silpo, BIL14_silpo, 
BIL15_silpo, BIL16_silpo, Bill_rhile, BILl_unknwn, and BIL2_unknwn. 

According to still further features in preferred embodiments, the 
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autoprocessing segment is derived from a protein of an organism belonging to a genus 
selected from the group consisting of Brucella, Clostridium, Magnetospirillum, 
Mannheimia, Methylobacterium, Neisseria, Pseudomonas, Rhodobacter, Silicibacter, 
Streptomyces, Thermobifida, Rhizobium, Chromobacterium, Myxococcus, Leptospira, 

5 Pirellula, Gemmata, Gloeobacter and Verrucomicrobium. 

According to still further features in preferred embodiments, the organism is 
selected from the group consisting of Rhodobacter capsulatus, Rhodobacter 
sphaeroides, Silicibacter pomeroyi, Brucella melitensis, Brucella suis, 
Magnetospirillum magnetotacticwn, Methylobacterium extorquens, Rhizobium 

10 leguminosarum, Neisseria meningitidis, Neisseria meningitidis, Neisseria 
meningitidis, Neisseria gonorrhoeae, Chromobacterium violaceum, Pseudomonas 
syringae, Pseudomonas fluorescens, Pseudomonas fluorescens, Mannheimia 
haemolytica, Myxococcus xanthus, Leptospira interrogans, Streptomyces coelicolor, 
Streptomyces avermitilis, Thermobifida fusca, Clostridium thermocellum, Pirellula 

15 species 1, Gemmata obscuriglobus, Gloeobacter violaceus, and Verrucomicrobium 
spinosum. 

According to still further features in preferred embodiments, the auto-cleavage 
results in removal of a segment of the chimeric polypeptide adjacent to an amino 
terminal end or a carboxy terminal end of the autoprocessing segment. 
20 According to still further features in preferred embodiments, the segment of 

the chimeric polypeptide adjacent to the autoprocessing segment is an amino terminal 
segment or a carboxy terminal segment of the chimeric polypeptide. 

According to still further features in preferred embodiments, the segment of 
the chimeric polypeptide adjacent to the carboxy terminal end of the autoprocessing 
25 segment includes an amino acid residue comprising a nucleophilic group at an amino 
terminal end thereof. 

According to still further features in preferred embodiments, the nucleophilic 
group is a hydroxyl group. 

According to still further features in preferred embodiments, the amino acid 
30 residue is a threonine residue. 

■ « 

According to still further features in preferred embodiments, the segment of 
the chimeric polypeptide adjacent to the amino terminal end of the autoprocessing 
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segment includes a serine amino acid residue at a carboxy terminal end thereof. 

According to still further features in preferred embodiments, the chimeric 
polypeptide is capable of the auto-cleavage under a condition selected from the group 
consisting of a temperature selected from a range of 33 °C to 41 °C, a pH selected 

5 from a range of pH 7.8 to pH 8.2, and a concentration of dithiothreitol selected from a 
range of 0. 1 ; mM to 20 mM. 

The present invention successfully addresses the shortcomings of the presently 
known configurations by providing novel chimeric polypeptides capable of defined 
auto-cleaving, including auto-cleaving resulting in defined auto-splicing, 

10 polynucleotides suitable for expressing such polypeptides, and methods of using such 
polypeptides and polynucleotides to purify proteins, affinity-select display phages, 
and post-translationally ligate proteins together. 

Unless otherwise defined, all technical and scientific terms used herein have 
the same meaning as commonly understood by one of ordinary skill in the art to 

15 which this invention belongs. Although methods and materials similar or equivalent 
to those described herein can be used in the practice or testing of the present 
invention, suitable methods and materials are described below. All publications, 
patent applications, patents, and other references mentioned herein are incorporated 
by reference in their entirety. In case of conflict, the patent specification, including 

20 definitions, will control. In addition, the materials, methods, and examples are 
illustrative only and not intended to be limiting. 



BRIEF DESCRIPTION OF THE DRAWINGS 

The invention is herein described, by way of example only, with reference to 

25 the accompanying drawings. With specific reference now to the drawings in detail, it 
is stressed that the particulars shown are by way of example and for purposes of 
illustrative discussion of the preferred embodiments of the present invention only, and 
are presented in the cause of providing what is believed to be the most useful and 
readily understood, description of the principles and conceptual aspects of the 

30 invention. In^^tfiis "regard, no attempt is made to show structural details of the 
invention in more detail than is necessary for a fundamental understanding of the 
invention, the description taken with the drawings making apparent to those skilled in 
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the art how the several forms of the invention may be embodied in practice. 
In the drawings: 

FIG. la is a sequence alignment diagram of the amino acid sequences of 
various Type A BIL domains (SEQ ID NOs: 8-62). The names of the domains and 

5 the amino acid sequence coordinates of their amino terminal residues in their host 
proteins are indicated to the left of the sequence. Numbers indicated in parentheses 
indicate the amino acid residue length of an intervening amino acid sequence which 
may have any amino acid sequence. Dashed lines have been inserted to assist in 
visualizing the alignments. 

10 FIG. lb is a sequence alignment diagram of the amino acid sequences of 

various Type B BIL domains (SEQ ID NOs: 63-104). The names of the domains and 
the amino acid sequence coordinates of their amino terminal residues in their host 
proteins are indicated to the left of the sequence. Numbers indicated in parentheses 
indicate the amino acid residue length of an intervening amino acid sequence which 

15 may have any amino acid sequence. Dashed lines have been inserted to assist in 
visualizing fee alignments. 

FIG. 2a is a sequence diagram depicting an amino acid sequence motif (SEQ 
ID NO: 105) exclusively defining amino acid sequences of a subset of Type A BIL 
domains, including: 39__9_thefus, SCP1.201_strco, 3875_87jmagma, 

20 B0372+_neimeB, B06554_neimeB, A2115_neime, MafBl_neimeC, BIL2_neimeC 3 
BIL4 neimeC, BIL5_neimeC, BIL6_neimeC, MafBl neigo, BIL2_neigo, 
BIL3_neigo, MafB2_neigo, BIL5_neigo, BIL6_neigo, FhaB _psesy, FhaBmanha, 
FhaBl_psefl-PfO-l, FhaBl_psefl-SBW25, BIL6_cloth, BIL5_cloth, BIL2_cloth, 
BIL4_cloth, BILl_cloth, BIL8_cloth, BIL9_cloth, BILl_gemob, BIL2_gemob, 

25 0709Jepin, 3725_lepin, ol078_myxxa, ol070_myxxa, BILl_strav, BIL2_strav, 
BIL3_strav 3 BILl_pirsp, BILl_chrvi, o3395 versp, o5687_versp, o649_versp, 
BIL1 glovi, BIL2_glovi, BIL3_glovi, and BIL4_glovi. Amino acid residues are 
indicated in standard single-letter code. X - any amino acid; X( 1-100) - any amino 

■ 

acid sequence composed of 1 to 100 amino acid residues. 
30 FIG. 2b is a sequence diagram depicting an amino acid sequence motif (SEQ 

ID NO: 106) exclusively defining amino acid sequences of a subset of Type B BIL 
•domains, including: 4825_rhosp, BIL2_rhosp, 00588jrhoca, 02710_rhoca, 
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01524_rhoca, 01523_rhoca, 00126_rhoca, 01216_rhoca, 00949_rhoca, 01374_rhoca, 
00459_rhoca, 00460_rhoca, 00746_rhoca, 03530_rhoca, 00199_rhoca, BEL3_magma, 
BIL4_magma, BILlbrusu, BILl_unknwn, BIL2_unknwn, 06786_metex, 
BILl_silpo, BIL2_silpo, BIL3_silpo, BIL4_silpo, BIL5_silpo, BIL6_silpo, 
5 BIL7_silpo, BIL8_silpo, BIL9_silpo, BIL10_silpo, BILll_silpo, BIL12_silpo, 
BIL1 3_silpp, BIL14_silpo, BIL1 5_silpo,* BIL1 6_silpo, BILl_rhile, and 
II0519_brume. Amino acid residues are indicated in standard single-letter code. X - 
any amino acid; X(l-100) - any amino acid sequence composed of 1 to 100 amino 
acid residues. 

10 FIGs. 3a-z are block-logo diagrams depicting conserved amino acids of Hint- 

like motifs of Type A BIL domains (Figures 3a-g) and Type B BIL domains (Figures 
3h-o) and of homologous Hint motifs of Hog proteins (Figures 3p-t) and inteins 
(Figures 3u-z). Unique BIL domain motifs are underlined with hatched lines. Motifs 
are ordered left to right in the amino to carboxy terminal positions along the protein 

15 sequences. Similar motifs are vertically aligned. The motifs are shown as sequence 
logos in which the heights of amino acid letter designations are proportional to their 
degree of conservation in each position. Protein splicing active site residues of intein 
Hint domains are indicated by asterisks. Motifs were identified and are displayed as 
previously described (Pietrokovski S., 1998. Protein Sci. 7, 64). The BIL domain 

20 motifs shown include sequences described in Figures la-b and Tables 1-2. The intein 
and hedgehog Hint domain sequences were obtained from previously published 
sources (Aspock G., 1999. Genome Res. 9, 909; Pietrokovski S., 2001. Trends Genet. 
17, 465). Hie amino acid residue position shown forming the carboxy terminal end of 
the motif depicted in Figure 3z actually represents the amino acid residue forming the 

25 amino terminal end of the carboxy terminal extein. Only intein and hedgehog motifs 
common to Hint domains are shown. 

FIGs. 4a-b are dendrograms depicting phylogenetic relationships of Type A 
and Type B BIL domains, respectively. 

FIGs. 5a-b are autoradiographs depicting SDS-PAGE separation of in-vitro 

30 translated, [ 35 S]-methionine-labeled, protein products of the MBP-BIL-CBD 
expression constructs pC2C-PsyBIL, and pC2C-RspBIL2 (Figures 5 a and 5b, 
respectively, "MBC" lane). Translation of control constructs pC2C ("MC" lane) was 
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used as a positive control and no DNA template ("— " lane) were used as a negative 
control. Molecular weights were estimated using translation products of pBEST/wc as 
reference standards. In Figure 5a, molecular weights ("mw" lane) were further 
estimated using unlabeled protein markers (dotted lines). In Figure 5b [MBP-CBD]' 

5 and [MBP]' indicate splicing and amino terminal cleavage products derived from 
MBP-RspBIL2-CBD. These fragments migrated at molecular weights greater than 
those the corresponding control fragments due to their containing residual 
BIL2_rhosp-flanking amino acid residues as a result of the pC2C-RspBIL2 cloning 
scheme described under Materials and Methods. Expected molecular weights of 

10 identified fragments are indicated in parentheses. The expected molecular weights of 
MBP-PsyBIL-CBD and of its carboxy terminal cleavage product MBP-PsyBIL, and 
its MBP-CBD splicing product were 66.3 and 59.1, and 50.6 kDa, respectively. 

FIG. 5c is a diagram of the amino acid sequence of FhaBjpsesy (SEQ ID NO: 
107) depicting the putative protein splicing motifs (underlined) and catalytic residues 

15 (double-underlined) responsible for auto-cleavage/-splicing activity by this Type A 
BIL domain. 

FIGs. 6a-b are photographs depicting SDS-PAGE analysis of autocatalytically 
processed protein products overexpressed in-vivo in E. coli transformed with the 
MBP-PsyBIL-CBD ("MBC" lane) expression construct pC2C-PsyBIL. Figure 6a 

20 depicts Coomassie Blue staining of chitin (lane 2) or amy lose (lane 3) affinity column 
separated protein. Lane 4 is a control showing protein from E. coli transformed with 
pC2C to overexpress MBP-CBD ("MC" lane) chimeric protein. Fragments 
corresponding to the MBP-PsyBIL carboxy terminal auto-cleavage product and the 
MBP-CBD autOTSplicing product of the chimera are indicated. Figure 6b is an 

25 autoradiograph depicting Western immunoblotting analysis of separated protein 
following purification on chitin beads. Both chitin purified samples from E. coli 
transformed with pC2C-PsyBIL or control plasmid pC2C to overexpress MBP- 
PsyBIL-CBD or MBP-CBD chimeric proteins, respectively, were separated in 
duplicate Jianes* and blotted onto a single nitrocellulose membrane. The membrane 

30 was cut in half and each sample was reacted in duplicate with either anti-MBP (anti- 
M) or anti-CBD (anti-C) antibodies. Both anti-M and anti-C antibodies reacted with 
the protein band corresponding to the mass of the MBP-CBD product. Protein bands 
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corresponding to MBP-PsyBIL and MBP products, that appear following purification 
on chitin beads result from non specific binding by excess amounts of overexpressed 
protein. Expected molecular weights of identified fragments are indicated in 
parentheses. The expected molecular weights of MBP-PsyBIL-CBD and of its 



5 carboxy terminal cleavage product MBP-PsyBIL, and its MBP-CBD splicing product 
were 66.3 and 59.1, and 50.6 kDa, respectively. 



ligation product (Figure 6c) and MBP-PsyBIL carboxy terminal, cleavage product 
(Figure 6d) electroeluted from SDS-PAGE gels. The expected molecular weights of 

10 MBP-PsyBIL-CBD and of its carboxy terminal cleavage product MBP-PsyBIL, and 
its MBP-CBD splicing product were 66.3 and 59.1, and 50.6 kDa, respectively. 

FIG. 7 is an amino acid sequence diagram depicting positioning of peptide 
sequences identified by MALDI peptide mass mapping analysis within the amino acid 
sequence of the MBP-CBD splicing product (SEQ ID NO: 108). Twenty-five tryptic 

15 peptide masses (underlined) were assigned to the amino acid sequence of the MBP- 
CBD protein, corresponding to 49 % coverage of the MBP-CBD sequence. Lettering 
in non-bold/italic font indicates the amino acid sequence of the MBP tag, and the 
amino acid sequence of the CBD tag (amino acids 394—461) is indicated in bold+italic 
font. The peptide corresponding to amino acids 388-396 contains the BIL domain 

20 splice site between amino acids Ser393 and Thr394. 



Underlined sequences correspond to peptides detected by MALDI. Lettering in non^ 
bold/italic font indicates the amino acid sequence of the MBP tag and that in 

25 bold/italic font indicates the amino acid sequence of the PsyBIL domain. The carboxy 
terminal end of the protein, asparagine N541 represents the carboxy terminal of the 
PsyBIL domain. The expected molecular weights of MBP-PsyBIL-CBD and of its 
carboxy terminal cleavage product MBP-PsyBIL, and its MBP-CBD splicing product 
were 66.3 and 59.1, and 50.6 kDa, respectively. 

30 FIG. 9 is a schematic diagram depicting functions of BIL domains. Hint 

domains are shown as dark gray horseshoes with their flanks as ovals. Proteins are 
depicted with amino termini positioned on the left. 



FIGs. 6c-d are data plots depicting MALDI mass spectra of MBP-CBD 



FIG. 8 is an amino acid sequence diagram depicting MALDI peptide mapping 

* 

of the 59.3 kDa MBP-PsyBIL carboxy terminal cleavage product (SEQ ED NO: 109). 
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FIGs. lOa-c are electrophoretic analyses depicting C-terminal auto-cleavage by 
MBP-RspBIL2a-CBD chimera. Figure 10a is a photograph of a Coomassie blue 
stained electrophoretic separation of in-vivo expressed MBP-RspBIL2a-CBD chimera 
affinity purified on amylose depicting C-terminal cleavage activity ("MB" product). 
Figure 10b is a photograph of a Western immunoblotting analysis depicting C- 
terminal cleavage activity ("MB" product) using anti-MBP antibodies (anti-M). 
Figure 10c is an autoradio graph of an SDS-PAGE separation of the in-vitro translated, 
[ 35 S]-methionine~labeled chimera depicting C-terminal cleavage activity ("MB" 
product). M - MBP specific fragment, MB - MBP-BIL specific fragment, MBC - 
intact chimera. 

FIGs. lla-b are electrophoretic analyses depicting N-terminal auto-cleavage 
by in-vivo expressed MBP-4825rhosp-CBD chimera. Figure 11a is a photograph of a 
Coomassie blue stained electrophoretic separation of chimera protein products 
depicting N-terminal cleavage activity of protein products affinity purified on chitin 
("BC" product) and affinity purified on amylose ("M" product). Figure lib is a 
photograph of a Western immunoblotting analysis depicting N-terminal cleavage 
activity using anti-CBD antibody as a probe ("BC" product) and anti-MBP antibody 
as a probe ("M" product). BC - BIL-CBD specific fragment, M - MBP specific 
fragment, MB - MBP-BIL specific fragment, MBC - intact chimera. 

FIGs. 12a-c are electrophoretic analyses depicting auto-processing and by in- 
vivo expressed MBP-BIL4_cloth~CBD chimera. Figure 12a is a photograph of a 
Western immunoblotting analysis of amylose (lane "A") or chitin (lane "C") purified 
protein products depicting auto-splicing activity using anti-CBD antibody as a probe 
("MC" product). Figure 12b is a photograph of a Western immunoblotting analysis of 
amylose (lane "A") or chitin (lane "C") purified protein products depicting auto- 
splicing activity ("MC" product). Figure 12b also shows carboxy terminal auto- 
cleavage of protein products affinity purified via amylose based affinity 
chromatography (lane "A",' "MB" product). Figure 12c is a photograph of a 
Coomassie blue stained electrophoretic separation of protein products isolated via 
amylose based (lane "A") or chitin based (lane "C") affinity chromatography 
depicting auto-splicing activity ("MC" species). Figure 12c also shows carboxy 
terminal auto-cleavage of protein products affinity purified Via amylose based affinity 
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chromatography (lane "A", "MB" species). Note the very small amounts of the 
uncleaved precursor (lane "C", "MBC" species) suggesting very efficient 
autoprocessing activity by this chimera. M - MBP specific fragment, MB - MBP- 
BIL specific fragment, MBC - intact chimera. 

5 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention is of chimeric autoprocessing polypeptides, 
polynucleotides encoding such polypeptides, and uses of such polypeptides and 
polynucleotides for reversibly binding proteins to specific substrates, reversibly 
10 binding specific substrates to each other, and auto-splicing amino acid sequences. 
Specifically, the present invention can be used to purify proteins, to affinity-select 
display phages, and to post-translationally ligate proteins together. 

Before explaining at least one embodiment of the invention in detail, it is to be 
understood that the invention is not limited in its application to the details set forth in 
15 the following description or exemplified by the Examples. The invention is capable 
of other embodiments or of being practiced or carried out in various ways. Also, it is 
to be understood that the phraseology and terminology employed herein is for the 
purpose of description and should not be regarded as limiting. 

Autoprocessing polypeptides, polypeptides having the capacity to post- 
20 translationally auto-cleave and/or auto-splice, can be used to greatly facilitate various 
industrially and scientifically important biochemical procedures, as described above. 
For example, such autoprocessing polypeptides can be used in applications which 
involve reversible binding of proteins, to specific substrates, such as protein 
purification, and reversible binding of specific substrates to each other, such as 
25 affinity-selection of display phages. 

Various chimeric autoprocessing polypeptides and methods of using such have 
been described in the prior art (reviewed in: Perler and Adam, 2000. Curr Opin 
Biotechnol. 1 1, 377-83; Paulus H., 2000. Annu Rev Biochem. 69, 447). 

However, all such prior art chimeric autoprocessing polypeptides suffer from 
30 various drawbacks. As described above, these drawbacks include suboptimal activity, 
poor stability, insolubility, requirement for strong auxiliary nucleophiles causing 
undesirable modifications at carboxy termini of cleaved amino terminal fragments, 



WO 2004/043997 





/EL2003/000956 



18 



and undesirable amino terminal cleavages. 

Thus, all prior art approaches have failed to provide optimal chimeric 
autoprocessing polypeptides for use in protein engineering. 

While reducing the present invention to practice it was uncovered that the 
5 chimeric polypeptides of the present invention display efficient auto-cleavage, 
including auto-cleavage resulting in auto-splicing. 

The chimeric polypeptides of the present invention comprise novel 
autoprocessing domains characterized by unique amino acid sequences, unique host 
protein/organism type origins, and unique natural biological capacities. Thus, the 
10 chimeric polypeptides of the present invention are highly novel and significantly 
enlarge and enhance the prior art spectrum of available types of chimeric 
autoprocessing polypeptides and their possible applications. 

Thus, according to one aspect of the present invention, there are provided 
chimeric polypeptides having efficient auto-cleavage activity and comprising an 



15 autoprocessing segment having an amino acid sequence set forth by SEQ ID NO: 8, 9, 
10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21,22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 
33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 
56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 
79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 

20 101, 102, 103, 104, 105 or 106. 

Preferably, the amino acid sequence of the autoprocessing segment is set forth 
by SEQ ID NO: 12, 31, 76, or 77. 



25 segment. Auto-cleavage occurs following exposure of the polypeptide of the present 
invention to suitable conditions in the absence of any other protein. Suitable auto- 
cleavage conditions are described hereinbelow. 

Depending on the purpose, application and configuration, the polypeptides of 



invention results in removal of a segment of the polypeptide adjacent to the amino 
terminal end or the carboxy terminal end of the autoprocessing segment. 



As used herein, the phrase "auto-cleavage activity" refers to cleavage of a 
polypeptide of the present invention in a region adjacent to the autoprocessing 



30 



the present invention may display different types of auto-cleavage activity. 

Preferably, the auto-cleavage activity of the polypeptides of the present 
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Preferably, the segment of the polypeptide adjacent to the autoprocessing 
segment is an amino terminal segment or a carboxy terminal segment of the 
polypeptide. 

According to one embodiment, the polypeptides of the present invention may 
5 comprise the autoprocessing domain as an amino terminal segment thereof. In this 
configuration, the polypeptides of the present invention may display removal of the 
segment thereof adjacent to the carboxy terminal end of the autoprocessing segment, 
i.e., the segment removed is the carboxy terminal segment of the polypeptide. 

According to further embodiments, the polypeptides of the present invention 
10 may comprise the autoprocessing domain as a carboxy terminal segment thereof. In 
this configuration, the polypeptides of the present invention may display removal of 
the segment thereof adjacent to the amino terminal end of the autoprocessing segment, 
i.e., the segment removed is the amino terminal segment of the polypeptide. 

According to yet further embodiments, the polypeptides of the present 
15 invention may comprise the autoprocessing domain as an internal segment of the 
polypeptide. In this configuration, the polypeptides of the present invention may 
display one or more of the following: removal of a segment adjacent to the amino 
terminal end of the autoprocessing segment, (i.e., the amino terminal segment of the 
polypeptide); removal of a segment adjacent to the carboxy terminal end thereof (i.e., 
20 the carboxy terminal segment of the polypeptide); removal of both the carboxy and 
amino terminal segments. 

In some cases, removal of both segments may result in subsequent covalent 
fusion between the removed segments and, as such, auto-splicing of the polypeptide. 

As used herein, the term "auto-splicing" refers to covalent bond formation 
25 between the amino acid residue forming the carboxy terminal end of a segment of the 
polypeptide adjacent to the amino terminal end of the autoprocessing domain and the 
amino acid residue forming the amino terminal end of a segment of the polypeptide 
adjacent to the carboxy terminal end of the autoprocessing domain. 

As is illustrated in the Examples section which follows, the polypeptides of the 
30 present invention demonstrate such auto-splicing activity. 

The polypeptides of the present invention may be advantageously used to post- 
translationally ligate essentially any protein to essentially any other protein via 
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formation of a covalent bond between amino acid residues forming complementary 
terminal ends thereof. This may be effected by using a polypeptide of the present 
invention comprising the proteins to be ligated in the configuration described 
hereinabove enabling auto-splicing to yield the desired ligation product. 



above, the qovalent bond may be an ester bond, such as the ester bond formed during 
auto-cleavage of prior art autoprocessing polypeptides. 

Depending on the application and purpose, auto-cleavage of the polypeptide 
may be specifically induced under suitable conditions, preferably a specific 
10 temperature, pH, or concentration of dithiothreitol (DTT). 

Preferably, the temperature is in the range of 33 to 41 °C, more preferably the 
temperature is in the range of 34 to 40 °C, more preferably the temperature is in the 
range of 35 to 39 °C, more preferably the temperature is in the range of 36 to 38 °C, 
more preferably the temperature is in the range of 36.5 to 37.5 °C, and most 
15 preferably the temperature is 37.0 °C. 

Preferably, the pH is in the range of pH 7.8 to 8.2, more preferably the pH is in 
the range of pH 7.9 to 8.1, and most preferably the pH is 8.0. 

Preferably, the concentration of dithiothreitol is in the range of 0.1-20 
millimolar; more preferably the concentration of dithiothreitol is in the range of 0.2— 



20 10 millimolar, more preferably the concentration of dithiothreitol is in the range of 
0.5—5 millimolar, and most preferably the concentration of dithiothreitol is in the 



Without being bound to a paradigm, the present inventors are of the opinion 
that auto-cleavage activity, in addition to being governed by the amino acid sequence 
25 of the autoprocessing segment, is also influenced by the amino acid sequence of the 
segments adjacent to the autoprocessing segment. 

For example, the present inventors are of the opinion that in configurations in 
which the polypeptide comprises a carboxy terminal segment adjacent to the carboxy 
terminal end of the autoprocessing segment, the efficiency of cleavage may be 
30 enhanced if such a carboxy terminal segment includes at its amino terminal end, an 

a 

amino acid residue comprising a nucleophilic group such as a sulfhydryl group, or 
more preferably a hydroxy! group. 



5 



Preferably, the covalent bond is a peptide bond. Alternately, as described 



range of 1 .0-2.0 millimolar. 
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As such, in cases wherein the polypeptide of the present invention includes a 
carboxy terminal segment adjacent to the autoprocessing segment, the amino acid 
residue forming the amino terminal end of such a carboxy terminal segment is 
preferably cysteine, serine, or more preferably threonine. As is illustrated in the 

5 Examples section below, polypeptides of the present invention having such a carboxy 
terminal segment display auto-cleavage of the carboxy terminal segment, including 
auto-cleavage resulting in auto-splicing. 

In configurations wherein the polypeptide of the present invention includes an 
amino terminal segment adjacent to the autoprocessing segment, the amino acid 

10 residue forming the carboxy terminal end of such an amino terminal segment is 
preferably serine. As is illustrated in the Examples section below, polypeptides of the 
present invention having such an amino terminal segment display auto-cleavage of the 
amino terminal segment, including auto-cleavage resulting in auto-splicing. 

It is recognized in the art that certain prior art autoprocessing segments lack 

15 the capacity to auto-cleave the bond attaching a given terminal end of the 
autoprocessing segment to particular flanking amino acid residues (Perler and Adam, 
2000. Curr Opin Biotechnol. 1 1, 377-83). 

In particular, certain prior art autoprocessing amino acid sequences lack the 
capacity to auto-cleave the bond attaching their amino terminal end to a flanking 

20 serine residue. As is illustrated in the Examples section which follows, and in sharp 
contrast to polypeptides containing such prior art autoprocessing amino acid 
sequences, the polypeptides of the present invention possess the capacity to auto- 
cleave such a bond. 

Specific amino acid sequences of autoprocessing domains comprised in the 
25 polypeptides of the present invention may be obtained by referring to Table 1 , Figures 
la-b, and Figures 2a-b of the Examples section below. Table 1 provides database 

can be used to retrieve the nucleic acid sequences encoding such 
amino acid sequences. Such amino acid sequences may easily be determined from 
such nucleic acid sequences by the ordinarily skilled artisan using a suitable nucleic 
30 acid-to-ammo acid sequence translation software, such, as, for example translation 
software made publicly available by the National Center for Biotechnology 
Information (NCBI) or the European Molecular Biology Laboratory (EMBL) on the 
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World Wide Web (WWW). Figures la-b provide specific amino acid sequences of 
various autoprocessing domains comprised in the polypeptides of the present 
invention. Figures 2a-b provide amino acid sequence motifs which define 
autoprocessing domains comprised in the polypeptides of the present invention. 

5 For example, as is shown in the Examples section which follows, polypeptides 

of the present invention comprising autoprocessing segments FhaB_psesy (SEQ ID 
NO: 12), BIL4_cloth (SEQ ID NO: 31), 4825_rhosp (SEQ ID NO: 76) or 
BIL2_rhosp (SEQ ID NO: 77) display auto-cleavage activity, including auto-cleavage 
activity resulting in auto-splicing. 

10 The polypeptides of the present invention may further comprise at least one 

affinity tag capable of specifically binding a substrate. As is further described 
hereinbelow, the affinity tag type depends on the intended use of the polypeptide. 

As used herein, the phrase "affinity tag" refers to any moiety (preferably a 
peptide or polypeptide moiety) which is capable of specifically binding a substrate. 

15 While the substrate can be essentially any substance or particle which can be 

specifically bound by the affinity tag, the substrate is preferably a molecule, a 
compound, a virus, or a cell. 

The polypeptides of the present invention may comprise essentially any 
affinity tag. 

20 Examples of peptide/polypeptide affinity tags include streptavidin, His-tags, 

strep-tags, epitope tags, maltose-binding proteins, and chitin-binding domains. 

His-tags (histidine tags) consist of a chain of 2 to 10, most preferably 6, 
contiguous histidine amino acid residues. His-tags have the capacity to specifically 
bind substrates including nickel. Ample guidance regarding tagging polypeptides 

25 with His-tags is available in the literature of the art (for example, refer to: Sheibaiii N. 
1999. Prep Biochem Biotechnol. 29:77). Purification of molecules comprising 
histidine tags is routinely effected using nickel-based automatic affinity column 
purification techniques. A suitable capture ligand for histidine-tagged molecules is 
the anti histidine tag single chain antibody 3D5 (Kaufmann, M. et aL, 2002. J Mol 

30 Biol. 318. 135-47). 

Examples of epitope tags include an 1 1 -mer Herpes simplex virus glycoprotein 
D peptide, and an 11 -mer N-terminal bacteriophage t7 peptide, being commercially 
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known as HSVTag and t7Tag, respectively (Novagen, Madison, WI, USA), and 10- or 
9-amino acid c-myc or Hemophilus influenza hemagglutinin (HA) peptides, which are 
recognized by the variable regions of monoclonal antibodies 9E10 and 12Ca5, 
respectively. 

5 Strep-tags are peptides having the capacity to specifically bind streptavidin. 

Ample guidance regarding the use of strep-tags is provided in the literature of the art 
(see, for example: Schmidt, TGM. and Skerra, A. 1993. Protein Eng. 6, 109; Schmidt 
TGM. et al y 1996. Journal of Molecular Biology 255, 753-766; Skerra A. and 
Schmidt TGM., 1999. Biomolecular Engineering 16, 79-86; Sano T. and Cantor CR. 

10 2000. Methods Enzymol. 326, 305-11; Sano T. et aL, 1998. Journal of 
Chromatography B 715, 85-91). 

Preferably, the affinity tag is a maltose-binding domain or a chitin-binding 
domain. 

Preferably, the maltose-binding domain is maZE-encoded maltose-binding 
1 5 protein (MBP). Ample guidance regarding the use of maltose-binding protein as an 
affinity tag is provided in the Examples section which follows and in the literature of 
the art (see, for example: Guan M. et aL, 2002. Protein Expr Purif 26, 229-34; 
Cattoli F. and Sarti GC, 2002. Biotechnol Prog. 18, 94-100). 

In cases where the affinity tag is a maltose-binding protein, the substrate is 
20 preferably amylose, a specific ligand of such an affinity tag. Alternately, the substrate 
may be maltose, also a specific ligand of such an affinity tag. 

As is shown in the Examples section below, polypeptides of the present 
invention comprising maltose-binding protein (MBP) can specifically bind a support 
including ainylose. 

25 Preferably, the chitin-binding domain is B, eirculans c&rf-encoded chitin 

binding domain (CBD). Ample guidance regarding the use of chitin-binding domain 
as an affinity tag is provided in the Examples section which follows and in the 
literature of the art (see, for example: Humphries HE. et aL, 2002. Protein Expr Purif. 
26, 243-8; Chong S. et aL, 1997. Gene 192, 271-81). 

30 In cases where the affinity tag is cZ?<i-encoded chitin binding domain (CBD), 

the substrate is preferably chitin, a specific ligand of such an affinity tag. 

As is illustrated in the Examples section which follows, polypeptides of the 



WO 2004/043997 ^piYIL20 03/00 0956 

24 

present invention comprising cM-encoded chitin-binding domain can specifically 
bind a support including chitin. 

The polypeptides of the present invention can be generated using chemical 
synthesis approaches or preferably recombinant techniques. 

5 While reducing the present invention to practice, nucleic acid sequences 

encoding polypeptides having putative autoprocessing segments were identified in 
nucleic acid sequence databases (see Examples section below for further detail). The 
sequences were analyzed and the autoprocessing segment encoding regions were 
identified and used to generate polynucleotides encoding the polypeptides of the 

1 0 present invention . 

Thus, according to another aspect of the present invention there is provided a 
polynucleotide sequence which encodes the auto-cleavable polypeptide of the present 
invention. 

The polynucleotides of the present invention can be assembled from genomic, 
1 5 and/or complementary sequences . 

As used herein, the phrase "complementary sequence" refers to a 
polynucleotide having a nucleic acid sequence resulting from reverse transcription of 
messenger RNA using a reverse transcriptase or any other RNA dependent DNA 
polymerase. Such sequences can be subsequently amplified in-vivo or in-vitro using a 
20 DNA dependent DNA polymerase. 

As used herein, the phrase "genomic sequence" refers to a polynucleotide 
derived from a chromosome which thus reflects a contiguous portion of a 
chromosome. 

In the case of a polynucleotide of the present invention encoding an 
25 autoprocessing domain expressed in a prokaryotic organism, the nucleic acid 
sequence encoding the autoprocessing domain may be conveniently generated via 
polymerase chain reaction (PCR) amplification using genomic DNA of the 
prokaryotic organism as a template. 

Alternately, in the case of a polynucleotide of the present invention encoding 
30 an autoprocessing domain expressed in a eukaryotic organism, the nucleic acid 
sequence encoding the autoprocessing domain may be conveniently generated via 
PCR amplification using a cDNA library derived from the organism as a template. 
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Suitable oligonucleotide primers for PCR amplifying nucleic acid sequences 
encoding specific autoprocessing domains comprised in polypeptides of the present 
invention can be designed using the nucleic acid sequences identified by the database 
coordinates provided in Table 1 of the Examples section which follows. 
5 For example, as is illustrated in the Examples section below, oligonucleotide 

primers suitable for PCR amplifying nucleic acid sequences encoding the 
autoprocessing domains FhaBjpsesy, or BIL2 rhosp were derived from nucleic acid 
sequences retrieved using the relevant database coordinates provided in Table 1 of the 
Examples section below. The nucleic acid sequences of such primers are set forth by 
10 SEQ ID NOs: 4-5, or 6-7, respectively. Ample guidance for determining suitable 
reaction conditions for amplifying nucleic acid sequences encoding the 
aforementioned autoprocessing domains is provided in the Examples section which 
follows. PCR amplification of nucleic acid sequences is a commonly performed 
procedure and suitable primers and reaction conditions for a broad range of such 
15 procedures can generally be routinely determined by one of ordinary skill in the art, 
for example via suitable software, such as, for example, OLIGO 4.0 (National 
Biosciences, Plymouth, Minn.). 

It will be appreciated that since autoprocessing activity is a characteristic of 
the amino acid sequence, one of ordinary skill in the art may alternatively use the 
20 amino acid sequences provided herein as a template for designing nucleic acid 
sequences which encode such amino acid sequences, and which take into 
consideration parameters such as codon usage which may increase the efficiency of 
expression of such sequences in specific organisms. 

As described above, the polypeptides of the present invention may further 
25 comprise at least one affinity tag. In order to produce polypeptides of the present 
invention comprising affinity tags, coding nucleotides are formed comprising nucleic 
acid sequences encoding such affinity tags. 

As described hereinabove, methods of generating nucleic acid sequences 
encoding affinity tags are well known to one of ordinary skill in the art. For example, 
30 nucleic acid sequences encoding affinity tags can be advantageously generated by 
PCR amplification of nucleic acid sequences encoding such affinity tags. 

For example, as described and demonstrated in the Examples section which 



WO 2004/043997 ^pT/IL2003/000956 

26 

follows, suitable oligonucleotide primers for amplification of nucleic acid sequences 
encoding B. circulans cM-encoded chitin-binding domain are set forth in SEQ ID 
NOs: 1-2. 

Alternately, autoprocessing domains may be conveniently cloned into nucleic 
5 acid constructs configured for expressing fusion proteins comprising an affinity tag 
fused to a polypeptide encoded by a nucleic acid insert cloned into such a construct. 

As is shown in the Examples section below, polynucleotides of the present 
invention comprising nucleic acid sequences encoding the affinity tag maltose- 
binding protein were assembled by cloning a nucleic acid sequence encoding an 
10 autoprocessing domain of the polypeptides of the present invention into the 
expression construct pMALC2 (New England Biolabs) which is designed to express 
fusion proteins comprising maltose-binding protein fused to a polypeptide encoded by 
a cloned insert. 

Such nucleic acid constructs can be advantageously used to insert and/or 
15 express a chimeric polynucleotide within a host cell. 

Thus, according to yet another aspect of the present invention there is provided 
a nucleic acid construct comprising a polynucleotide of the present invention. 

The nucleic acid constructs of the present invention preferably comprise 
suitable promoter sequences so as to enable efficient expression of the polynucleotide 
20 of the present invention in an expression system. 

Expression of the polynucleotides of the present invention may be 
conveniently controlled using an inducible promoter. A suitable inducible promoter is 
an isopropyl beta-D-thiogalactoside (EPTG)-inducible promoter, such as a T7 
promoter. As described in the Examples section which follows, a T7 promoter can be 

* 

25 used to drive IPTG-inducible expression of a polynucleotide of the present invention 

■ 

in a suitable cell-free expression system or in a cellular expression system. 

IPTG-induced expression of polynucleotides under the regulatory control of 
T7 promoters is widely practiced in the art by the ordinarily skilled practitioner and 
ample guidance regarding the use of such promoters is available in the literature of the 
30 art (see, for example, Sambrook et al 9 infra). 

The nucleic acid constructs of the present invention can be used to produce the 
polypeptides of the present invention in a suitable expression system. 
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Thus, according to still another aspect of the present invention there is 
provided a method of generating a polypeptide of the present invention. 

The method is effected by generating a chimeric amino acid sequence 
including an autoprocessing segment of the present invention. Preferably the chimeric 
5 amino acid sequence is generated by expressing a polynucleotide of the present 
invention in an expression system suitable for generating the chimeric amino acid 
sequence from the chimeric polynucleotide. 

The nucleic acid constructs of the present invention can be used to express the 
polypeptides of the present invention in various expression systems, including any 
10 cellular or cell-free expression systems suitable for expressing recombinant proteins 
such as the polypeptides of the present invention. 

When used to express the polypeptides of the present invention in a cell-free 
expression system, the constructs of the present invention may be advantageously 
expressed in any suitable in-vitro transcription/translation system. 
15 Numerous in-vitro transcription/translation systems are commercially 

available for expressing recombinant proteins such as the polypeptides of the present 
invention. 

For example, a suitable cell-free expression system for expressing nucleic acid 
constructs of the present invention is an E. coli S30 extract expression system, as 
20 described and as demonstrated in the Examples section below. 

Numerous cellular expression systems, including yeast, bacterial, insect, and 
mammalian cells can be employed to express the nucleic acid constructs of the present 
invention. 

As described and illustrated in the Examples section which follows, the 
25 polypeptides of the present invention may be advantageously expressed in E. coli by 
transforming E. coli with the nucleic acid constructs. 

Transformation of E. coli with nucleic acid constructs is a routine procedure 
widely practiced in the art (see, for example, Sambrook et aL, infra). 

For example, for expression of the polypeptides of the present invention using 
30 the nucleic acid constructs of the present invention in E. coli 9 competent cells capable 
of DNA uptake may be prepared from cells harvested in exponential growth phase 
and rendered competent via the widely practiced CaCh method. Addition of MgCh or 
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RbCl to the transformation reaction medium may be employed to increase 
transformation efficiency. Alternative transformation methods include methods such 
as electroporation or host cell protoplast transformation. 

As described hereinabove, the capacities of the polypeptides of the present 
5 invention to specifically bind substrates and to auto-cleave can be advantageously 
used in* various practical applications involving reversible binding of substrates, such 
as protein purification. 

Thus, according to a further aspect of the present invention there is provided a 
method of purifying a protein. 
10 The method is effected by generating a polypeptide of the present invention 

■ 

comprising an autoprocessing segment being terminally attached to, or flanked by, an 
amino acid sequence of the protein, immobilizing the polypeptide to a support, and 
subjecting the immobilized polypeptide to suitable conditions for enabling auto- 
cleavage resulting in removal of the protein from the polypeptide. 

15 According to a preferred embodiment, the polypeptide of the present invention 

is configured such that one terminal end of the autoprocessing segment is adjacent to a 
terminal segment of the polypeptide being the protein to be purified and the other 
terminal end of the autoprocessing segment is adjacent to a terminal segment of the 
polypeptide comprising an affinity tag. 

20 According to this embodiment, the polypeptide of the present invention is 

preferably immobilized via a specific binding of the affinity tag to a specific ligand 
thereof included in the support. Optionally, in cases where auto-cleavage further 
results in detachment of the autoprocessing segment from the terminal segment of the 
polypeptide comprising the affinity tag, the method may advantageously further 

25 comprise the step of separating the protein from the autoprocessing segment so as to 
further facilitate purification of the protein. Such separation may be effected as 
described further hereinbelow. 

According to another embodiment, the polypeptide of the present invention 

» ■ 

consists of a chimera comprising the autoprocessing segment fused to the protein to 
30 be purified. 

According to this embodiment, the polypeptide of the present invention is 
preferably immobilized via a specific binding of the autoprocessing segment to a 
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specific ligand of the autoprocessing segment included in the support. 

According to yet another embodiment, the polypeptide of the present invention 
consists of an autoprocessing segment being flanked at its amino terminal end with an 
amino terminal segment of the amino acid sequence of the protein to be purified, and 

5 being flanked at its carboxy terminal end with the carboxy terminal segment of the 
amino acid sequence of the protein to be purified complementing the amino terminal 
segment of the amino acid sequence of the protein to be purified. 

According to this embodiment, the polypeptide of the present invention is 
preferably immobilized via a specific binding of the autoprocessing segment to a 

10 specific ligand of the autoprocessing segment included in the support, and auto- 
cleavage results in auto-splicing of the complementary amino and carboxy terminal 
segments of the protein to be purified to thereby release the protein. 

In embodiments in which the polypeptide of the present invention is 
immobilized via the autoprocessing segment, the specific ligand included in the 

15 support is preferably an antibody or antibody fragment capable of specifically binding 
the autoprocessing segment. 

Purification of a protein according to the method of the present invention may 
be advantageously effected via standard affinity chromatography techniques. For 
example, a suitable support for immobilizing a polypeptide of the present invention 

20 may be an affinity resin coupled to a specific ligand of the polypeptide of the present 

■ 

invention packed in a standard affinity purification column. Following subjecting of 
the support-bound polypeptide of the present invention to conditions suitable for auto- 
cleavage thereof, the highly purified protein released from the support-bound 
autoprocessing segment may be conveniently recovered as a flow-through fraction 

25 eluted from the column. 

In the case described hereinabove, wherein auto-cleavage further results in 
detachment of the autoprocessing segment from the support-bound segment of the 
polypeptide of the present invention, separating the protein from the autoprocessing 
segment may be effected analogously to the method described hereinabove by using 

30 such standard affinity chromatography techniques wherein the affinity resin includes a 
specific ligand of the autoprocessing segment. 

Alternately, various methods suitable for separating such mixtures of 
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polypeptides may be practiced by the ordinarily skilled artisan. Such techniques 
include, for example high-performance liquid chromatography (HPLC), size- 
exclusion chromatography, and similar methodologies. 

Ample guidance regarding chromatographic isolation of proteins is widely 

♦ 

5 available in the literature of the art (see, for example: Wilchek M. and Chaiken I. 
2000. Methods Mol Biol 147, 1-6; Jack GW., 1994. Mol Biotechnol 1, 59-86; 
Narayanan SR., 1994. Journal of Chromatography A 658, 237-258; Nisnevitch M. and 
Firer MA., 2001. J Biochem Biophys Methods 49, 467-80; Janson JC. and Kristiansen 
T. in Packings and Stationary Phases in Chromatography Techniques. Unger KK. 

10 (ed.), Marcel Dekker, New York, pp. 747 (1990); Clonis YD: HPLC of 
Macromolecules: A Practical Approach, IRL Press, Oxford, pp. 157 (1989); Nilsson J. 
era/., 1997. Protein Expr Purif. 11, 1-16). 

As described in the Examples section which follows, amylose affinity ligand 
based column chromatography of a polypeptide of the present invention comprising 

15 the autoprocessing domain BIL2_rhosp flanked at its amino terminal end with an 
amino terminal segment including the affinity tag MBP, and flanked at its carboxy 
terminal end with the amino acid sequence of a protein to be purified resulted in 
column-retention of a segment of the polypeptide of the present invention lacking the 
amino acid sequence of the protein to be purified (Figure 10a), thereby demonstrating 

20 the utility of the method of the present invention for purifying proteins. 

As described in the Examples section which follows, chitin affinity ligand 
based column chromatography of a polypeptide of the present invention comprising 
the autoprocessing domain 4825_rhosp flanked at its carboxy terminal end with an 
carboxy terminal segment including the affinity tag CBD, and flanked at its amino 

25 terminal end with the amino acid sequence of a protein to be purified resulted in 
column-retention of a segment of the polypeptide of the present invention lacking the 
amino acid sequence- of the protein to be purified (Figure 1 lb), thereby demonstrating 
the utility of the method of the present invention for purifying proteins. 

According to the teachings of the present invention, the polypeptides Of the 

30 present invention may include affinity tags flanking the autoprocessing segment. 

Thus, according to yet a further aspect of the present invention there is 
provided a method of reversibly attaching a first substrate to a second substrate. 
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The first and second substrates may be the same or may be different. 

The method is effected using a polypeptide of the present invention in which 
the autoprocessing segment is flanked by a first amino acid sequence capable of 
binding the first substrate and a second amino acid sequence capable of binding the 
5 second substrate, such that auto-cleavage releases the first amino acid sequence from 
the second amino acid sequence. Exposing the first substrate and the second substrate 
to the polypeptide of the present invention generates a complex including the first 
substrate attached via the polypeptide to the second substrate. Following generation 
thereof, the complex is subjected to suitable conditions for auto-cleavage, thereby 
10 detaching the first substrate from the second substrate. 

Complex generation may be effected in various ways, depending on the 
application and purpose. For example, complex generation may be effected wherein 
neither, one or both substrates is included in, or consists of, a support specifically 
binding an affinity tag comprised in the polypeptide of the present invention. 
15 According to a preferred embodiment, the method is used for reversibly 

attaching a first substrate being a protein displayed by a bacteriophage to a second 
substrate specifically binding the phage-displayed protein, which substrate being 
included in a support. 

According to this embodiment, the method is effected using a polypeptide of 
20 the present invention comprising a first amino acid sequence having the capacity to 
specifically bind the substrate and a second amino acid sequences having the capacity 
to bind the phage-displayed protein. 

The. method according to this aspect of the present invention may be 

i 

advantageously employed with phage-display libraries for selecting bacteriophages 
25 displaying a protein having high affinity to a specific substrate. This may be effected 
by exposing a phage display library to a support including a substrate being a target 
molecule to which a high affinity ligand is desired. Elements of the phage display 
library not being bound with high affinity to the support may be washed and recovery 
of phages specifically binding the substrate with high affinity via a displayed protein 
30 capable of specifically binding the target molecule with high affinity may be 
conveniently recovered by subjecting the support-bound phages to conditions suitable 
for auto-cleavage of the polypeptide of the present invention so as to effect the 
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detachment thereof from the support. 

Additional objects, advantages, and novel features of the present invention will 
become apparent to one ordinarily skilled in the art upon examination of the following 
5 examples, which are not intended to be limiting. Additionally, each of the various 
embodiments and aspects of the present invention as delineated hereinabove and as 
claimed in the claims section below finds experimental support in the following 
examples. 



Reference is now made to the following examples, which together with the 
above descriptions, illustrate the invention in a non limiting fashion. 



Generally, the nomenclature used herein and the laboratory procedures utilized 
in the present invention include molecular, biochemical, microbiological and 

15 recombinant DNA techniques. Such techniques are thoroughly explained in the 
literature. See, for example, "Molecular Cloning: A laboratory Manual" Sambrook et 
ah, (1989); "Current Protocols in Molecular Biology" Volumes I-III Ausubel, R. M., 
ed. (1994); Ausubel et ah, "Current Protocols in Molecular Biology", John Wiley and 
Sons, Baltimore, Maryland (1989); Perbal, "A Practical Guide to Molecular Cloning", 

20 John Wiley & Sons, New York (1988); Watson et ah, "Recombinant DNA", 
Scientific American Books, New York; Birren et ah (eds) "Genome Analysis: A 
Laboratory Manual Series", Vols. 1-4, Cold Spring Harbor Laboratory Press, New 
York (1998); methodologies as set forth in U.S. Pat. Nos. 4,666,828; 4,683,202; 
4,801,531; 5,192,659 and 5,272,057; "Cell Biology: A Laboratory Handbook", 

25 Volumes I-III Cellis, J. E., ed. (1994); "Current Protocols in Immunology" Volumes 
I-III Coligan J. E., ed. (1994); Stites et al. (eds), "Basic and Clinical Immunology" 
(8th Edition), Appleton & Lange, Norwalk, CT (1994); Mishell and Sbiigi (eds), 
"Selected Methods in Cellular Immunology", W. H. Freeman and Co., New York 
(1980); available immunoassays are extensively described in the patent and scientific 

30 literature, see, for example, U.S. Pat. Nos. 3,791,932; 3,839,153; 3,850,752; 
3,850,578; 3,853,987; 3,867,517; 3,879,262; 3,901,654; 3,935,074; 3,984,533; 
3,996,345; 4,034,074; 4,098,876; 4,879,219; 5,011,771 and 5,281,521; 
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"Oligonucleotide Synthesis" Gait, M. J., ed. (1984); "Nucleic Acid Hybridization" 
Hames, B. D., and Higgins S. J., eds. (1985); 'Transcription and Translation" Hames, 
B. D., and Higgins S. J., eds. (1984); "Animal Cell Culture" Freshney, R. L, ed. 
(1986); "Immobilized Cells and Enzymes" IRL Press, (1986); "A Practical Guide to 

5 Molecular Cloning" Perbal, B., (1984) and "Methods in Enzymology" Vol. 1-317, 
Academic Press; "PCR Protocols: A Guide To Methods And Applications", 
Academic Press, San Diego, CA (1990); Marshak et aL, "Strategies for Protein 
Purification and Characterization - A Laboratory Course Manual" CSHL Press 
(1996); all of which are incorporated by reference as if fully set forth herein. Other 

10 general references are provided throughout this document. The procedures therein are 
believed to be well known in the art and are provided for the convenience of the 
reader. 

Unless otherwise defined, all technical and scientific terms used herein have 
the same meaning as commonly understood by one of ordinary skill in the art to 
15 which this invention belongs. Although methods and materials similar or equivalent 
to those described herein can be used in the practice or testing of the present 
invention, suitable methods and materials are described below. 

EXAMPLE 1 

Bacterial-intein like (BIL) domains: novel auto-cleaving/auto-splicing protein 

domains 

Autoprocessing polypeptides capable of auto-cleavage have been shown to be 
uniquely useful in a wide range of protein engineering applications, for example for 
protein purification without the requirement for proteases. However, all prior art 
autoprocessing polypeptides suffer from various drawbacks, including suboptimal 
activity, stability, solubility, and requirement for auxiliary molecules causing 
undesirable protein modifications. In order to enlarge and enhance the current 
repertoire of autoprocessing polypeptides, the present inventors have identified, 
generated and demonstrated the functionality of novel autoprocessing polypeptides, as 
follows. 

Materials and Methods: 

In order to identify novel auto-cleaving/-splicing proteins, databases storing 
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genomic sequences of various organisms, including bacterial pathogens were searched 
for open reading frames (ORFs) coding for protein sequences containing Hint 
domains. Following identification of protein sequences containing Hint domains, 
such protein sequences were cloned and tested for auto-cleaving/-splicing activity, as 

5 described below. 

Data sources: BILs were identified in bacterial genomes by searching the 
following databases: National Center for Biotechnology Information (NCBI) sequence 
databases for Brucella melitensis (B. melitensis) 16M, Streptomyces coelicolor (S. 
coelicolor) A3 (2), Neisseria meningitidis {K meningitidis) MC58, N. meningitidis 

10 Z2491, Pseudomonas fluorescens (P. fluorescens) PfO-1, Leptospira interrogans (L. 
interrogans) 56601, Streptomyces avermitilis (S. avemiitilis) MA-468, Pirellula 
species 1, and Chromobacterium violaceum (C. violaceum) ATCC 12472 sequences; 
Integrated Genomics (http://www.integratedgenomics.com) sequence databases for 
Rhodobacter capsulatus (R. capsulatus) SB 1003 genomic sequences (Haselkom et al, 

15 2001. Photosynthesis Research 70, 43-52) and Methylobacterium extorquens (M 
extorquens) AMI,; Joint Genome Institute (http://www.jgi.doe.gov) sequence 
databases for Rhodobacter sphaeroides (R. sphaeroides) 2.4.1 (Mackenzie et al 9 
2001. Proc Natl Acad Sci USA. 99, 2275-2280), Magnetospirillum magnetotacticum 
(M magnetotacticum) MS-1, Clostridium thermocellum (C. thermocellum) ATCC 

20 27405, and Thermobifida fusca (T. fused) YX genomic sequences; The Institute for 
Genomic Research (http://www.tigr.org) sequence databases for Pseudomonas 
. syringae (P. syringae) DC3000 (Fouts et al, 2002. Proc Natl Acad Sci USA 99, 
2275-2280), Silicibacter pomeroyi (S. pomeroyi) DSS-3, Gemmata obscuriglobus (G. 
obscuriglobus) UQM 2246, Myxococcus xanthus (M xanthus) DK1622, and 

25 Verrucomicrobium spinosum (K spinosum) DSM 4136 sequences; The Sanger 
Institute (http://www.sanger.ac.uk) sequence databases for Neisseria meningitidis (N 
meningitidis) FAM18, P. fluorescens SBW25, and Rhizobium leguminosarum (R. 
leguminosarum) bv. viciae 3841 sequences; University of Oklahoma, Advanced 
Center for Genome Technology (http://www.genome.ou.edu) sequence databases for 

30 Neisseria gonorrhoeae (N gonorrhoeae) FA1 090 genomic sequences; Baylor College 
of Medicine Human Genome Sequencing Center (http://www.hgsc.bcm.tmc.edu) 
sequence databases for Mannheimia haemolytica (M haemolytica) PHL213 genomic 
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sequences; and Kazusa DNA Research Institute database (Japan; 
http://www.kazusa.or.jp) for Gloeobacter violaceus (G. violaceus) PCC 7421 
sequences. 

BIL domain nomenclature: BEL domains were named using the format "a_b" 
5 where "b" is an abbreviation of the bacterial species of origin and where "a" is either 
host protein name (e.g., "FhaB" or "MafB"); an arbitrary BIL# designation; an 
Integratedgenomics Database (http://ergo.integratedgenomics.com/R_capsulatus.html) 
number for R. capsulatus; a Computational Biology Program at ORNL 
(http://genome.oml.gov/microbial/rsph) analysis code for R. sphaeroides; or a gene 

10 number for Bjnelitensis strain 16M and K meningitidis strains MC58 and Z249L 
Available gene identifier accession numbers and further information relevant to 
identified BEL domains are provided in Table 1, below. 

Computational sequence analysis: The BLAST software package of the 
NCBI was used for sequence-to-sequence searches (Altschul, SF. et al. 9 1997. Nucleic 

15 Acids Res. 25, 3389) of BIL domains with BIL domains and with intein sequences, 
and the BLIMPS software was employed for block-to-sequence searches (Henikoff S. 
et aL, 1995. Gene 163, GC17). Multiple block sequence alignments were constructed 
using BLOCKMAKER (Henikoff S. et aL, 1995. Gene 163, GC17) and MACAW 
(Schuler GD. et aL, 1991. Structure, Function and Genetics 9, 180) software, as 

20 previously described (Pietrokovski S., 1998. Protein Science 7, 64). BIL domains 
were aligned with other BIL domains having higher scores than with intein sequences 
and alignments of BIL domains with each other was across their whole, or almost 
whole, lengths (results not shown). This is also a practical way to classify BIL 
domains as such. Phylogenetic analysis was performed using the PHYLIP software 

25 package (Felsenstein J., 1989. Cladistics 5, 164) version 3.55. 

Generation of BIL domain phytogeny dendrograms: BIL domain phylogeny 
dendrograms were computed from DNA multiple sequence alignment of 49 mostly 
complete BIL domains aligned across 201 positions, coding for 67 amino acids which 
could be confidently aligned across BIL domains. Nodes with bootstrap values below 

30 440/1000 were collapsed, and bootstrap values above 800/1000 are shown. Bootstrap 
values of the nodes grouping all A-type and B-type BILs are 441 and 519, 
respectively. The D. melanogaster Hedgehog Hint domain (Porter JA., et aL, 1996. 
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Cell 86, 21) was used as an outgroup to root the tree. The dendrogram was calculated 
using the DNADIST program (version 3.5) of the PHYLIP software package 
(Felsenstein J., 1989. Cladistics 5, 164). Results were verified against those obtained 
using CLUSTALW software (Thompson JD. et al, 1994. Nucl Acid Res. 22, 4673) 
5 and from protein multiple sequence alignments obtained using PHYLIP, PROTDIST, 
and CLUSTALW software. 

BIL functional activity assays: 

In order to analyze the capacity of BIL domains to auto-cleave/auto-splice 
flanking sequences, genetic sequences encoding BIL domains and portions of flanking 

10 sequences were cloned for expression as chimeric proteins tagged at their amino 
terminal ends with the malE gene-encoded maltose-binding protein (MBP) affinity 
tag, and at the carboxy terminal end with the B. circulans cbd gene-encoded chitin- 
binding domain (CBD) affinity tag. These chimeras were expressed in an in-vitro 
transcription/translation system, or overexpressed in-vivo in E. coli, and resulting 

15 protein products were analyzed for evidence of BIL domain-mediated autoprocessing 
activity. 

Such chimeras were cloned using BIL domain genetic sequences encoding: 
(i) the Type A BIL domain FhaBjpsesy (Table 1, Figure la) and its 
downstream-flanking threonine residue to generate the chimera "MBP-PsyBIL-CBD"; 



(ii) the Type B BIL domain BIL2_rhosp (Table 1, Figure lb) including 32 
amino terminal-flanking and 1 1 carboxy terminal-flanking amino acids to generate the 
chimera "MBP-RspBIL2-CBD". 

Constructs: B. circulans cbd gene sequences encoding CBD were cloned by 

25 PCR from expression vector pTYB2 (New England Biolabs, Beverly, MA) using the 
primers 5 -AAATGTCGACTGCGGTGGCCTG ACC-3 ' (SEQ ID NO: 1) and 5'- 
TGTCGT ATTGCTTCCTTTC G GGCTT-3 ' (SEQ ID NO: 2), and inserted, including 
the upstream linker 5 '-TGCGGTGGC CTG ACCGGTCTG AACTC AGGCCTC-3 ' 
(SEQ ID NO: 3), into the Sall/PstI linearized, isopropyl beta-D-thiogalactoside 

30 (IPTG)-inducible MBP-fusion protein expression construct pMALC2 to generate the 
MBP-CBD fusion protein expression construct pC2C. Construct pC2C was used in 
functional assays as a positive control for expression of MBP-CBD. 



20 



and 
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For MBP-PsyBIL-CBD expression, genetic sequences encoding PsyBIL and 
flanking sequences were PCR amplified from P. syringae DC3000 strain genomic 
DNA (kindly provided by Dr. G. Sessa, Tel- Aviv University) using the primers 5'- 
AAAAGGATCCTGCTTTGCGGCCGGAACGA-3 ' (SEQ ID NO: 4) and 5'- 
5 AAAATCT AG AGGTATT ATGCACCC ATGTCTTG-3 ' (SEQ ID NO: 5), and cloned 
in BamHI/5£baI linearized pC2C, between the malE MBP-encoding sequences and the 
CBD expressing cbd sequences to generate the MBP-PsyBIL-CBD expression 
construct pC2C-PsyBIL. 



10 and flanking sequences were PCR-amplified from R. sphaeroides 2.4.1 strain genomic 
DNA (supplied by Dr. Steven L. Porter, Department of Biochemistry, University of 
Oxford) using the primers 5 '-G AATTCGGTG ATTC ATCCTTGGGGCG A-3 ' (SEQ 
ID NO: 6) and 5 '-TCTAGAA AAAC ACGGC AAGGGCGAGCGG-3 ' (SEQ ID NO: 
7), and cloned in EcoRI/Xbal linearized pC2C, between the MBP-encoding malE 

15 sequences and the CBD-encoding cbd sequences to generate the MBP-RspBIL2-CBD 
expression construct pC2C-RspBIL2. 

the MBP-BIL4cloth-CBD expression construct pC2C-BIL4cloth. 

Polymerase chain reactions were performed using a Biometra thermal cycler in 
a 50 jlxI reaction mixture containing Taq polymerase buffer (Sigma, St. Louis, MI), 1 
20 pi Taq DNA polymerase, 200 raM dNTP, 10 mM of each primer and 100 ng genomic 
DNA 

The chimeras were constructed such that carboxy terminal or amino terminal 
cleavage thereof was expected to generate [MBP-BIL + CBD] or [MBP + BIL-CBD] 
specific protein products, respectively. The MBP- or CBD-containing protein 

25 products were expected to vary in size according to the size of the BIL domain 
flanking sequences included in the BIL sequences cloned in the chimeric proteins. 
Auto-splicing by the chimeras was expected to generate MBP-CBD protein products 
having a moleeular weight varying according to the size of the BEL domain flanking 
sequences included in the cloned BIL segment. 

30 Chimeras were expressed and analyzed for evidence of BIL processing 

activity, as described below. 



For MBP-RspBIL2-CBD expression, genetic sequences encoding RspBIL2 
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In-vitro BIL protein expression and activity assays: In-vitro transcription- 
translation of MgP-BIL-CBD and MBP-CBD chimeric proteins was achieved using 
chimera expression constructs pC2C-RspBIL2 and pC2C-RspBEL2a, and expression 
construct pC2C as DNA templates, respectively, using E. coli S30 extract for circular 

5 DNA system (Promega Kit #L1030, Promega, Madison WI). Reactions were carried 
out according to the manufacturer's instructions, using a reaction containing 0.25 mM 
[ S]-methionine, 220 nmol of expression construct DNA as template, 1-2 mM 
dithiothreitol, and having a pH of about 8.0. Reactions were incubated at 37 °C for 
90-120 minutes. Prior to electrophoresis of expressed protein, 5 jal or 10 fil aliquots 

10 of reaction mixtures were mixed with four volumes of acetone in order to remove 
polyethylene glycol. Acetone precipitation was followed by centrifugation at 12,000 
x g for 5 minutes. The supernatant was discarded and the protein-containing pellet 
was mixed with gel loading buffer to give a final concentration of 0.06 M Tris-Cl, 2 % 
SDS, 10 % (v/v) glycerol, and 0.01 % brbmophenol blue. Proteins were separated via 

15 7.5 % or 10 % SDS-PAGE, and the separated proteins were visualized using a 
phosphor-imaging screen. Phosphor-imaging signals were quantified using N1H 
IMAGE 1.62 software. Product quantities were derived from values of three 
independent experiments averaged for each sample together with their standard 
deviation of the means. The molar percentage of each product was calculated. 

20 The expected molecular weights of MBP-RspBIL2-CBD, and of its splicing 

product MBP-CBD, its carboxy terminal cleavage product MBP-RspBIL2, and its 
MBP-containing amino terminal cleavage product are 68.0, 55.1, 60.5 and 46.7 kDa, 
respectively. 

The expected molecular weights : of control vector pC2C-expressed MBP-CBD 
25 and of its MBP portion were 50.3 and 43.0 kDa, respectively. 

In-vivo Type A BIL domain protein expression, purification and activity 
assay: Competent TBI E. coli cells (NEB, Beverly, MA) were transformed with 
constructs for expression of MBP-BIL-CBD chimeras. Transformants were plated on 
LB agar supplemented with ampicillin (100 jig/ml). Single colonies were used to 
30 inoculate 3 ml aliquots of LB medium supplemented with ampicillin (100 jig/ml). 
Following incubation at 37 °C for 16 hours with shaking, 1 ml of culture was used to 
inoculate a 2 liter flask containing 500 ml of LB supplemented with ampicillin (100 
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Hg/ml). Incubation was continued at 37 °C with shaking until the optical density at 
600 nm was 0.6, at which point IPTG was added to a final concentration of 0.3 mM. 
After further incubation for 3 hours, cells were harvested by centrifugation at 5,000 x 
g for 20 minutes, re-suspended in solution containing 20 mM Tris (pH 7.4), 200 mM 

5 NaCl, and protease inhibitor cocktail (Sigma, St. Louis, MI), and lysed by sonication. 
Lysates were centrifuged at 17,000 x g for 20 minutes to remove cell debris, and 
supernatants were harvested for subsequent analyses. Proteins . were then affinity 
purified with either chitin (NEB, Beverly, MA) or amylose beads (NEB, Beverly, 
MA) which bind to the CBD or MBP affinity tags included in the chimeric protein. 

10 Elution of protein from beads prior to electrophoresis was performed by mixing the 
protein-bound beads with SDS-PAGE sample loading buffer. 

Western immunoblotting assays and protein staining: Products generated by 
expression of MBP-BIL-CBD chimeras were separated by SDS-PAGE. Briefly, 
protein samples were mixed with protein loading buffer to give a final concentration 

15 of 0.06 M Tris-Cl, 2 % SDS, 10 % (v/v) glycerol, 0.1 M dithiothreitol, and 0.01 % 
bromophenol blue. All samples were boiled for 3 minutes prior to electrophoresis. 
Separated proteins were analyzed by Western immunoblotting using either 
monoclonal mouse anti-MBP (Novus Biologicals, Inc. Littleton, CO) antibody for 
identification of the MBP tag, or polyclonal rabbit anti-CBD (NEB, Beverly, MA) for 

20 identification of the CBD tag. Secondary antibodies used were HRP conjugated goat 
anti-mouse IgG or goat anti-rabbit IgG (Jackson ImmunoResearch Laboratories Inc., 
West Grove, PA). Relative apparent molecular weights were calculated using 
TriChromoRanger (Pierce, Rockford IL) prestained markers. 

Electrophoretic gels containing separated proteins were fixed in 40 % 

25 methanol/7 % acetic acid and stained with PhastGel Blue R stain (Pharmacia Biotech 
AB, Sweden). Gels were destained in 40 % methanol/7 % acetic acid and then in 
deionized water, and visualized protein bands were excised and electroeluted for 
MALDI mass spectroscopy (MS) analysis. 

Etectroelution of protein: Protein was electroeluted from gels at 150 volts for 

30 2 hours in GeBAflex tubes (Gene Bio Application Ltd., Israel) using elution buffer 
containing 0.025 % SDS, Tris and Tricine (pH 8.5). Following electrocution SDS 
was removed from electroeluted protein using cold TCA: acetone precipitation in the 
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presence of 0.5 % sodium deoxycholate (NaDOC; T. Mehlman and A. Shainskaya, 
unpublished). 

In-gel proteolysis: Protein bands from PhastGel Blue R stained gels were 
destained using multiple washes in 50 % acetonitrile in 50 mM ammonium 

5 bicarbonate. Destained protein bands were subsequently reduced, alkylated and in-gel 
proteolysed using either bovine trypsin (sequencing grade, Roche Diagnostics, 
Germany) or chymotrypsin, (Boehringer Mannheim, Germany) by incubation with 
12.5 ng/|il protease in 50 mM ammonium bicarbonate at 37 °C, as previously 
described (Shevchenko et al. 9 1996. Analytical Chemistry 68, 850). Extracted peptide 

10 solutions were dried for subsequent MALDI-MS analysis. 

Mass Spectrometry: Intact molecular mass measurement and peptide mass 
mapping were performed using a Bruker Reflex III MALDI time-of-flight (TOF) 
mass spectrometer (Bruker, Bremen, Germany) equipped with SCOUT source, 
delayed ion extraction, reflector and a 337 nm nitrogen laser. Each mass spectrum 

15 was generated using data accumulated from 200 laser shots. Both external and nearby 
calibrations for proteins were performed using BSA and myoglobin (Sigma). For 
peptide mapping, internal calibration with molecular ions of regularly occurring 
matrix ions and peptides derived from trypsin was additionally performed to 
consolidate further peptide assignment. 

20 Intact molecular weight measurements by MALDI MS: Gel electroeluted 

proteins were further purified by cold acetone precipitation. The dried extract from 
one lane of the gel was re-dissolved in 0.5 ml of 80 % formic acid and immediately 
diluted with water to yield a solution containing 20 % formic acid, and 50 % of this 
solution was applied to a target plate. 

25 Peptide mass mapping by MALDI mass spectrometry: Aliquots of one tenth 

of the extracted peptide mixture volume, dissolved in 0.1 % TFA or formic 

« 

acid/isopropanol/water (1:3:2), were used for MALDI-MS using the fast evaporation 
or dry droplet method. Matrix surfaces of a-cyano-4-hydroxycinnamic acid (4- 
HCCA) or 2,5-dihydroxybenzoic acid (DHB) were utilized for the fast evaporation 
30 (Jensen ON. et al, 1996. Rapid Commun in Mass Spectrom. 10, 1371; Vorm O. et ah, 
1994. Analyt Chem. 66, 3281) or dry droplet method (Kussmann K. et al, 1997. J 
Mass Spectrom. 32, 593), respectively. 
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Experimental Results: 

Identification of two novel bacterial intein-like domains containing Hint-like 
motifs: Searches of sequence databases of diverse bacterial species for Hint-like 
motif-containing putative ORFs identified open reading frames coding for proteins 
5 comprising two related types of novel intein-like protein domains termed by the 
present inventors Type A and Type B bacterial intein-like (BIL) domains. Novel type 
A and B BIL domain sequences identified are shown aligned in Figures la (SEQ ID 
NOs: 8-62) and lb (SEQ ID NOs: 63-104), respectively. The bioinformatic sources 
used to identify these BIL domains are shown in Table 1 . 

10 An amino acid sequence motif (SEQ ID NO: 105) was identified (Figure 2a) 

which exclusively defines a subset of Type A BIL domains, including domains: 
39_9_thefus, SCP1 .20 l_strco, 3875_87_magma, B0372+_neimeB, B0655+_neimeB, 
A2115_neime, MafB l_neimeC, BIL2_neimeC, BIL4_neimeC, BIL5_neimeC, 
BIL6_neimeC, MafBl_neigo, BIL2_neigo, BIL3_neigo, MafB2_neigo, BIL5_neigo, 

15 BIL6_neigo, FhaB_psesy, FhaBjtnanha, FhaBl_psefl-PfO-l, FhaBljpsefl-SBW25, 
BIL6_cloth, BIL5_cloth, BIL2_cloth, BIL4_cloth, BILl_cloth, BIL8_cloth, 
BIL9_cloth, BILl_gemob, BIL2_gemob, 0709_lepin, 3725_lepin, ol078_myxxa, 
ol070_myxxa, BILl_strav, BIL2_strav, BIL3_strav, BILl_pirsp, BILl_chrvi, 
o3395_versp, o5687_versp, o649_versp, BILl^glovi, BEL2_glovi, BEL3_glovi, and 

20 BIL4_glovi. 

An amino acid sequence motif (SEQ ID NO: 106) was identified (Figure 2b) 
which exclusively defines a subset of Type B BIL domains, including: 4825_rhosp, 
BIL2_rhosp, 00588__rhoca, 02710_rhoca, 01524_rhoca, 01523_rhoca, 00126_rhoca, 
01216_rhoca, 00949_rhoca, 01374_rhoca, 00459_rhoca, 00460_rhoca, 00746_rhoca, 
25 03530_rhoca, 00199_rhoca, BIL3_magma, BIL4_magma, BILl_brusu, 

■ 

BILl^unknwn, BIL2_unknwn, 06786_metex, BILl_silpo, BIL2_silpo, BIL3_silpo, 
BIL4_silpo, BIL5_silpo, BIL6_silpo, BIL7_silpo, BIL8_silpo, BIL9_silpo, 
BIL10_silpo, BILll_silpo, BIL12_silpo, BIL13_silpo, BIL14_silpo, BIL15_silpo, 
BIL16_silpo, BILl_rhile,andn0519J>rume. t- " 
30 This new type of domain appears in non-conserved regions of hyper-variable 

proteins. Thus, these domains are distinct from the Hint domains of inteins and Hog- 
proteins by the species and proteins in which they appear. An analysis of amino acid 
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residue conservation within Hint-like motifs of BIL domains and within homologous 
Hint domain motifs of Hog proteins and inteins is shown in (Figures 3a-z). 
Examination of BLAST sequence alignments (Altschul, SF. et ah, 1997. Nucleic 
Acids Result. 25, 3389) of BIL domains with BIL domains and with intein sequences 
5 showed that BIL domains aligned with each other with higher scores than with intein 
sequences ^cross their whole, or almost whole, lengths (results not shown). 
Therefore, BIL domains were found to be distinct from inteins by their global 
sequence features. 



1 0 Table L Databases used to identify BIL domains. 



BIL 
Typ 
e 


BIL domain 
name 


Source** 


Date*** 


Cohtig/Entry 


Coordinates 


A 


BILl_cloth 


NCBI 




23022619 




A 


BIL2_cloth 


NCBI 




28N'aa+23020813+ 
59aa+23020812 


- 


A 


BIL3_cloth 


NCBI 




23022239+1 4N'aa 




A 


BIL4_cloth 


NCBI 




230208 17+5N'amin 
oacid/gi|23020817 


311-445 


A 


BIL5 cloth 


NCBI 




2302081 5+1 3N'aa 


- 


A 


BIL6_cIoth 


NCBI 




23022237 


- 


A 


BIL7_cloth 


NCBI 




23022587+7N'aa 


- 


A 

A 


BIL9_cloth 


NCBI 

r 




23022893+59aa+ 
23022892 




A 


BILlO^cloth 


NCBI 




22262017 


34594-34986 


A 


BIL1 l_doth 


NCBI 




22262176 


416-728 


A 


3 875_87_magma 


NCBI 




21614488 


76532-75165 


A 


FhaB manha 


BCM 


4Oct01 


C78-C85 


11046-20977 


A 


BIL2_neigo 

i 


OU- 
ACGT 


26Sep00 


AE004969 


1563413-1564129 : 


A 


BIL3_neigo 


OU- 
ACGT 


26Sep00 


AE004969 


1565033-1565809 


A . 


BIL5_neigo ; 


OU- 
ACGT 


26Sep00 


AE004969 


1351509-1350766 . 

< 


A 


BIL6_neigo 


OU- 
ACGT 


26Sep00 

> 


AE004969 

i 

■ 


1349978-1349310 ; 


A 


MafBl_neigo 


OU- 
ACGT 


26Sep00 


AE004969 


1560214-1561941 ! 


A 


MafB2_neigo 


oy— 

ACGT 


26Sep00 


AE004969 


1355876-1354062 

• > 
* - 1 


A 


B0369+ neimeB 


NCBI 


• 


7225591 +34 N'aa 1 




a ; 


B0372+_neimeB 


NCBI 




7225594+11 N'aa 




A 


B0655+_neimeB ; 


NCBI 




7225882 +14 N' aa 




A 


BIL2_neimeC 


Sanger 


15May02 


NmC 


1836857-1837573 ; 
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A 


BIL3_neimeC 


Sanger 


15May02 


NmC 


1838418-1838981 


A 


BIL4_neimeC 


Sanger 


15May02 


NmC 


1839771-1840439 


A 


BIL5_neimeC 


Sanger 


1 5May02 


NmC 


627204-627920 


A 


BDL6_neimeC 


Sanger 


15May02 


NmC 


628395-629102 


A 


MafB l_neimeC ; 


Sanger 


15May02 


NmC 


1833717-1835480 


A 


FhaBl_psefl- 
PfO-1 


NCBI- 
TIGR 




205922-575 


3-6313 


A 


FhaBl_psefl- 
SBW25 


Sanger 


2Sep02 


Pflu552a0l 


41728-29387 


A 

A 


FhaB__psesy 


TIGR 


30Aug02 


5668 


5148986-5149429 


A 

A 


SCPL201_strco 


NCBI 




13620683 +32 N' aa 




A 

A 


39_9_thefus 


JGI 


INovOO 


39 


13655-15508 


A 


BIL1 gemob 


TIGR 


23Sep02 


14 


32-20392 


A 

A 


BIL2_gemob 


TIGR 


14Feb03 


354 


16692-11734 


A 


0709_lepin 


NCBI 


10Nov02 


24213409 


- 


A 


3725_Jepin 


NCBI 


10Nov02 


24216424 


- 


A 


3719Jepin 


NCBI 


10Nov02 


24197710 


- 


A 

A 


o665_myxxa 


TIGR 


23Apr03 


168 


495-3704 


A 


ol078_myxxa 


TIGR 


23Apr03 


157 


103477-106710 


A 


i /■> #^ 

ol070_myxxa 


TIGR 


23Apr03 


168 


495-3704 


A 


BILl_strav 


NCBI 


20Jul03 


29826740 


- 


A 


BIL2_strav 


NCBI 


20Jul03 


29826826 


- 


A 


BIL3__strav 


NCBI 


20Jul03 


29831835 


- 


A 


BILl_pirsp 


NCBI 


20Jul03 


32470666 


- 


A 


BILl_chrvi 


NCBI 


07Sep03 


34104178 


- 


A 


BILl_glovi 


Kazusa 


04Sep03 


gll0211 


- 


A 


BIL2_glovi 


Kazusa 


04Sep03 


gll0207 


- 


A 


BIL3_glovi 


Kazusa 


04Sep03 


g!102I3 


- 


A 

A 


BIL4_glovi 


Kazusa 


04Sep03 


g!10212 


- 


A 

A 


BIL5_glovi 


Kazusa 


04Sep03 


gll0205 


- 


A 

A 


BIL6_gIovi 


Kazusa 


04Sep03 


gl!0208 


i 


A 


BIL7_glovi 


Kazusa 


04Sep03 : 


gs!3615 


- 


A 


o649_versp 


TIGR 


02Sep03 


65738 


3-1949 


A 

A ; 


o5687_versp 


TIGR 


02Sep03 


65921 


18348-1288 


A 

A 


o3395_versp 


TIGR 


02Sep03 


65925 


56853-46669 


B 


1105 1 9_brume 1! 


NCBI 




17988864 




B 


BIL2_magma 


NCBI 




21613062 


922-1590 


B 


BIL3_magma 


NCBI 




21614112 


2449-1475 


B 


TT jt 

BIL4_magma 


NCBI 


. 


21614173 


2216-3187 


B 


BILSmagma 


NCBI 




21612572 


3-338 


B 


BIL6^magma 


NCBI 


• 


21613847 


2033-1774 


B 


06786_metex : 


IG 


Jun02 


1507 


6076-7113 


£> 


uu i zo rnoca 


ICj 


DecOl ' 


2G06-2D11 


114767-113670 


B 


00199 rhoca 


IG 


DecOl 


2G06-2D11 


178648-177806 


B 


00459_rhoca. 


IG 


DecOl 


2G06-2D1 1 


434469-435083 


B 


00460 rhoca 


IG 


DecOl 


2G06-2D11 


435094-436191 


B 


00746 rhoca 


IG 


DecOl 


2D10-2D06 


2243-3079 


B 


00949_rhoca 


IG 


DecOl 


2A12-2D05 


325707-326651 
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B 


01216_rhoca 


IG 


DecOl 


2A12-2D05 


222590-223555 


B 


01374 rhoca 


IG 


DecOl 


2A12-2D05 


148470-149462 


B 


01523_rhoca 


IG 


DecOl 


1A01-1C09 


279638-280444 


B 


01524jrhoca 


IG 


DecOl 


1A01-1C09 


280569-281423 


B 


02710_rhoca 


IG 


DecOl 


1D09-1F02 


197288-199588 


B 


03530_rhoca 


IG 


DecOl 


1A01-1C09 


521700-521173 


B 


4825__rhosp 


JGI 


26Mar01 


184 


67785-67165 


B 


BIL2_rhosp 


JGI 


26Mar01 


177 


9673-10194 


B 


BILl_silpo 


TIGR 


18Jun02 


50 

4 


10679-14440 


B 


BIL2_silpo 


TIGR 


1 8 Jun02 


50 


1-2688 


B 


BIL3_silpo 


TIGR 


1 8 Jun02 


4 


10260-12356 


B 


BIL4_silpo 


TIGR 


18Jun02 


290 


18579-19640 


B 


BIL5_silpo 


TIGR 


18Jun02 


11 


3224-2217 


B 


BIL6_snpo 


TIGR 


18Jun02 


199 


13687-15303 


B 


BIL7_silpo 


TIGR 


18Jun02 


199 


15104-16399 


B 


BIL8_silpo 


TIGR 


1 8 Jun02 


32 


1329-2600 


B 


BIL9__silpo 


TIGR 


1 8 Jun02 


32 


28672-26588 


B 


BIL10_siIpo 


TIGR 


1 8 Jun02 


60 


19338-18265 


B 


BILll_silpo 


TIGR 


1 8 Jun02 


89 


10027-11181 


B 


BIL12_silpo 


TIGR 


18Jun02 


126 


9020-10066 


B 


BIL13_silpo 


TIGR 


18Jun02 


110 


13640-14440 


B 


BIL14_siIpo 


TIGR 


18Jun02 


125 


9769-10353 


B 


BlL15_silpo 


TIGR 


18Jun02 


129 


7727-8257 


B 


BIL16_silpo 


TIGR 


1 8 Jun02 


195 


7271-8050 


B 


Bill_rhile 


Sanger 


14Jul03 


RHIZ10E3Cbl2,sl 
k 




B 


BIL1 unknwn 


TIGR 


13Mar01 


14712 


1124-51 


B 
» 


BIL2_unknwn 


TIGR 


13Mar01 


12703 


2-1369 



* o — 'vwuvnivu kjj ujv uaviuiai opt^lto WUO -LUilUWlllg U1C 

underscore in the BIL name (refer to Table 2 below for the code-species correspondance). 

The sources are named as follows JGI - Joint Genome Institute (http://www.jgi.doe.gov), NCBI - 
(http://www,ncbi.nlm.nih.gov), Sanger - The Sanger Institute (http://www.sanger.ac.uk), OU-ACGT - 
University of Oklahoma, Advanced Center for Genome Technology (http://www.genome.ou.edu), 
TIGR - The Institute for Genomic Research (http://www.tigr.org), BCM -r Baylor College of Medicine 
Human Genome Sequencing Center (http://www.hgsc.bcm.tmc.edu), IG - IntegratedGenomics 
(http://www.integratedgenomics.com), and Kazusa - Kazusa DNA Research Institute database (Japan; 
http://www.kazusa.or.jp). 
***Dates refer to the data release dates used. 

The NCBI entries were extended as noted. Coordinates of the BIL host protein ORF are given for 
nucleotide contigs/entries. The positions of BILs within these ORFs are provided in Figures la-b. 
^An identical sequence was identified in Brucella suis 

Table 1 cont 



Conserved motifs in BIL domains, Hog proteinslmd inteins: Alignments of 
conserved motifs in Type A BIL domains, Type B BIL domains, Hog proteins, and 
inteins are shown in Figures 3a-g, 3h-o, 3p-t, and 3u-z, respectively. Type A BIL 
domains were found to share 7 Hint-like consensus sequence motifs (Figures 3a-d and 
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3f-g) and one novel motif (Figure 3e) having no known Hog or intein counterpart. 

Almost all Type A BIL domains were found to comprise apparent functional 
protein splicing active sites corresponding to those present in inteins (marked by 
asterisks in Figures 3u, 3w and 3z), and several are also flanked at their carboxy 
5 terminal ends with serine or threonine amino acid residues, similarly to the carboxy 
terminal ends of inteins. 

* 

Type A BIL domains were found to contain an invariant His-Asn amino acid 
residue pair adjacent to the carboxy terminal end thereof (Figure 3g), similarly to the 
His-Asn amino acid residue pair typically forming the carboxy terminal ends of 

10 inteins (Figure 3z, positions 7-8). Conservation of the Type A BIL domain carboxy 
terminal end with that of inteins suggests that, similarly to inteins, Type A BILs 
undergo cyclization of the Asn residue forming the carboxy terminal end thereof. 
However, the residue at the carboxy terminal end of Type A BIL domains (Figure 3g, 
position 8), which corresponds to the residue forming the amino terminal end of 

15 polypeptide segments flanking the carboxy terminal ends of inteins which is always 
Cys, Ser or Thr (Figure 3z, position 9), is not conserved, since only a few Type A BIL 
domains have a serine or threonine residue in that position. Other Type A BIL 
domains have aspartate, glutamate, asparagine, tyrosine or alanine residues in that 
position, which are not found in any intein. 

20 Type B BIL domains were also found to share 6 Hint-like consensus sequence 

motifs (Figures 3h-i and 31-o) and two novel motifs (Figures 3j-k) having no known 
Hog protein or intein Hint domain counterpart. Type B BIL domain carboxy terminal 
ends were found to have a conserved position comprising Cys, Ser or Thr residues 
(Figure 3o, position 6), potentially corresponding to the carboxy terminal flanking 

25 position of inteins (Figure 3z, position 9), however this carboxy terminal residue of 
Type B BIL domains (Figure 3o, position 7) is not preceded by the His-Asn motif 
typically found in inteins (Figure 3z, positions 7-8) and in Type A BIL domains 
(Figure 3g, positions 6-7). The -SH/-OH groups on the side chains of the 
aforementioned Cys, Ser or Thr residues in intein host proteins have been found to be 

30 essential for ligation of the intein carboxy and amino flanks in the protein splicing 
reaction (Xu MQ. and Perler FB., 1996. EMBO J. 15, 5146). 

In Type A BIL domains whose carboxy terminal ends are not flanked by Thr 
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or Ser residues, Asn cyclization may nevertheless occur without trans-esterification by 
the flanking residue. Alternately, trans-esterification may occur by the mildly 
nucleophilic residues found in this position. In the first case the BIL domain would be 
cleaved from a segment flanking its carboxy terminal end, and in the second case 

m 

5 protein splicing would occur. Since Type B BIL domains do not have any conserved 
' Asn or Gin residue at their carboxy terminal end, cleavage of this end could then 
proceed by a mechanism different from the Asn and Gin cyclizations of inteins 
(Paulus H., 2000. Annu Rev Biochem. 69, 447). 

Key amino acid residues corresponding to protein splicing active sites (marked 
10 by asterisks in Figures 3u, 3w, and 3z-position 9) were found to be conserved in Type 
B BIL domains. 

Both Type A and Type B BIL domains were found to be distinct from inteins 
in having additional unique sequence motifs, in not being integrated in highly 
conserved sites of essential proteins, and in not comprising endonuclease domains. 

15 Phylogenetic distribution of BIL domains: The phylogenetic distribution of 

the BIL domains identified is shown in Table 2. BIL domains were identified in 3 
evolutionarily distant bacterial types - alpha, beta and gamma proteobacteria (gram- 
negative bacteria), actinobacteria (high GC gram-positive bacteria), and 
Bacillus/Clostridium group bacteria (low GC gram-positive bacteria). 

20 Both presence and genomic distribution were found to be variable, even in 

closely related species and strains. For example, 1, 3 and 6 ORFs encoding BIL 
domains were identified, respectively, in N. meningitidis strains whose genomes have 
been completely or almost completely sequenced; 2 and 14 ORFs encoding BIL 
domains were identified in 2 different Rhodobacter species; and while one such ORF 

25 was identified in P. syringae, none were found in P. aeruginosa and P. putida. 

BIL domains and inteins were found to coexist in certain species. For 
example, the genome of M magnetotacticum was found to comprise ORFs encoding 
both Type A and Type B BIL domains, and that of T. fusca was found to comprise 
ORFs encoding both BIL domains and inteins. 

30 The variability observed in the number of BIL domain ORFs in different 

species is probably due to gene duplications. As shown in a dendrogram 
demonstrating phylogenetic relationships of Type A BIL domains (Figures 4a), all 
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BIL domains derived from Neisseria species cluster together, and BIL domains from 
different species sub-cluster as well, implying that all Neisseria BIL domains arose by 
duplication from a single ancestor and that some are paralogs within different species. 
The latter is corroborated by the apparent duplication of some gene loci containing 
5 BIL domains in these species (not shown). Clustering of BIL domains from the same 
species was also observed in C. thermocellum (Figure 4a) and M. Magnetotacticum 
(Figure 4b). 

BIL domain host proteins: BIL domains were identified in putative ORFs 
coding for a few hundred to a few thousand amino acids. Several BIL domains were 
10 found to be flanked by domains present in secreted bacterial proteins. In P. syringae 
and M. haemotytica, BIL domains were identified near the carboxy terminal end of 
FhaB-like ORFs. FhaB is a very large Bordetella gene coding for a secreted 
filamentous hemagglutinin protein, which functions as an adhesin important for B. 
pertussis virulence (Smith AM. et al, 2001. FEMS Microbiol Rev. 25, 309). Three of 

15 the R. capsulatus BIL domain-containing ORFs include RTX repeats - calcium 
binding repeats found in various secreted bacterial proteins, including many toxins 
(Coote JG., 1992. FEMS Microbiol Rev. 8, 137). In N. meningitidis and N. 
gonorrhoeae, BIL domains were identified in MafB proteins. These are part of 
multiple adhesin family possibly involved in glycolipid adhesion to cells (Naumann et 

20 al, 1999. Curr Opin Microbiol 2, 62-70; Paruchuri et al, 1990. Proc Natl Acad Sci U 
S A. 87, 333-7). Three other Neisseria BIL domains were found to have an HNH 
nuclease domain in amino acid sequences flanking their carboxy terminal ends. HNH 
domains are found in various DNase and endonuclease proteins including secreted 
toxins {Belfort M. and Roberts RJ., 1997. Nucleic Acids Res. 25, 3379; James R. et 

25 al., 1996. Microbiology 142, 1569). A domain present in the amino acid sequence 
flanking the carboxy terminal end of a BIL domain in the gram-positive bacterium T. 
Jusca is also found in a short, conserved Salmonella ORF (GenBank accession 
NP_454902) and in an amino acid sequence flanking the carboxy terminal end of a N. 
meningitidis FhaB/hemoIysin protein (gene NMA0688). Both of these proteins are 

30 from gram-negative bacteria and are likely to be secreted. 
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Table 2. Phylogenetic distribution ofBIL domains identified. 



Taxonomic group 


Species 


Species code 


BIL 
type 


No of BIL 
domains 
identified 
in 

organism 


ai^jiia. L/icuual< IC1 la 


JvnuuOuuCier CQpSUlCLlUS odI UUJ 


rhoca 


B 


14 


alexia uiuibUUuvlvl la 


jxnououacier spnaeroiaes 


rhosp 


B 


2 


txi^jna. ^ji yj lv>vj u<*\^ id la 


OlilCluuCier pOTTlQrOyi JJoo-j 


silpo 


B 


16 


altVhfl TYmtpnV.;* pterin 


Brucella memensis i oivu ^ tsruceila suis 


brume 


B 


1 


altvha nrotpnhar>tf»r.» 

aipua piuicuuavlClla 


iviagneiospiruium magnetotacticum Mo-1 


magma 


A 
B 


i 

5 


alnha T*irrjt<*nhfif*tpri5i 


alpha proteobacteria 


Methylobacterium extorquens AMI . 


metex 


B 


r 


aipna proteobacteria 


Rnizobium leguminosarum bv. viciae 3841 


rhile 


B 


i 


beta proteobacteria 


Neisseria meningitidis Z2491 


neimeA 


A 


i 


beta proteobacteria 


Neisseria meningitidis MC58 


neimeB 


A 


3 


beta proteobacteria 


Neisseria meningitidis FAM18 


neimeC 


A 


6* 


beta proteobacteria 


Neisseria gonorrhoeae FA 1090 


neigo 


A 


6 


beta proteobacteria 


Chromobacterium violaceum ATCC 12472 


chrvi 


A 


I 


gamma 
proteobacteria 


Pseudomonas syringae DC3000 


psesy 


A 


r 


gamma 
proteobacteria 


Pseudomonas fluorescens PfO-1 


psefl-PfO-1 


A 


i* 


gamma 
proteobacteria 


Pseudomonas fluorescens PfSB W25 


psefl-SBW25 


A 


r 


gamma 
proteobacteria 


Mannheimia haemolytica PHL213 


manha 


A 


i* 


delta proteobacteria 


Myxococcus xanthus DK1622 


myxxa 


A 


3' 


spirochaetes 


Leptospira interrogans 56601 


lepin 


A 


3 


actinobacteria 


Streptomyces coelicolor A3 (2) j 


strco 


A 


1 


; actinobacteria 


Streptomyces avermitilis MA-468 


strav 


A 


3 


actinobacteria 


J- ti&rniuuijiLici jusca is\ 


tneru 


A 


1 


Bacillus/Clostridium 
group 


Clostridium thermocellum ATCC 27405 


cloth 


A 


10" 


planctomycetes 


Pirellula species 1 


pirsp 


A 


1 


planctomycetes 


Gemmata obscuriglobus UQM 2246 


gemob 


A 


2" 


cyanobacteria 


Gloeobacter violaceus PCC 7421 


glovi 


A 


1 


verrucomicrobium 


Verrucomicrobium spinosum DSM4136 


versp 


A 


3* 


unknown 


Unknown 




B 


2* 



Genome not fully sequenced — total number of BILs may be greater. 



BIL domain-mediated auto-cleavage/auto-splicing activity: 
Type A BIL domain-mediated auto-cleavage/auto-splicing activity in an in- 
vitro transcription/translation system: Electrophoretic analysis (Figure 5a) showed 
that protein products generated following in-vitro tr^iscription/translation of the 
MBP-PsyBIL-CBD expression construct pC2C-PsyBIL displayed molecular weights 
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corresponding to the uncleaved precursor MBP-PsyBIL-CBD, the splicing product 
MBP-CBD, and the carboxy terminal cleavage product MBP-PsyBIL. Two additional 
protein products displayed molecular weights of 43 and 45 kDa. Control reactions 
using the MBP-CBD expression construct pC2C as a transcription template produced 
5 two protein products, one corresponding in weight to MBP-CBD and the other to the 
MBP portion thereof. The latter may be observed in chimeric proteins having MBP as 
an amino terminal tag (refer to: NEB instruction manual "pMAL protein fusion and 
purification system", Catalog #E8000S). The 43 kDa protein product may represent a 
premature transcription or translation stop side product unrelated to BIL domain- 

10 mediated activity. Appearance of the 45 kDa band, not seen in the control reaction 
and slightly larger than the expected weight of MBP, may be due to an additional 
termination point introduced in the BIL domain. As radioactive methionine was used 
to label the reaction products, and as, unlike the MBP and BIL domains, the CBD 
domain lacks a methionine residue, its isolated product cannot be visualized according 

15 to the protocol employed. 

The relative amounts of the MBP-PsyBIL-CBD, MBP-PsyBIL and MBP-CBD 
protein products were found to be 15 %, 57 % and 28 %, respectively. 

These results therefore demonstrated the capacity of Type A BIL domains to 
auto-cleave and auto-splice flanking sequences. 

20 Type B BIL domain-mediated auto-cleavage and auto-splicing activity in an 

in-vitro transcription/translation system: Electrophoretic analysis (Figure 5b) 
showed that protein products generated following in-vitro translation of the MBP- 
RspBIL2-CBD expression construct pC2C-RspBIL2 included proteins with sizes 
corresponding to the unprocessed MBP-RspBIL2-CBD precursor, the carboxy 

25 terminal cleavage product MBP-RspBIL2, the MBP-containing amino terminal 

■ 

cleavage product, and the splicing product MBP-CBD. Also apparent was a 43 kDa 
MBP-containing fragment which also appeared in the control reaction, as described 
above. 

These results therefore demonstrated the capacity of Type B BIL domains to 
30 auto-cleave and auto-splice flanking sequences. The putative protein splicing motifs 
and catalytic residues of the FhaB__psesy amino acid sequence responsible for the 

* 

observed .autoprocessing activity are shown in Figure 5c (SEQ ID NO: 107). 



WO 2004/043997 ^^T/IL2003/000956 

50 

Recombinant BIL domains expressed in-vivo in E. coli display auto-cleavage 
and auto-splicing activity - mass spectrometric confirmation of BIL-mediated 
autoprocessing activity: In order to examine the possibility of industrially producing 
functional recombinant Type A BIL domain, pC2C-PsyBIL was overexpressed in- 

■ 

5 vivo in E. coli, and the recombinant protein products were analyzed for BIL domain- 
mediated autoprocessing activity. 

Similarly to the in-vitro results described above, purified protein from E. coli 
transformed with pC2C-PsyBIL was found to include a protein product having a 
molecular weight corresponding to the MBP-PsyBIL carboxy terminal cleavage 

10 product, as well as a protein product having a molecular weight corresponding to the 
MBP-CBD auto-splicing product, as determined by both Coomassie Blue staining and 
Western immunoblotting analysis of SDS-PAGE separated proteins (Figures 6a and 
6b, respectively). The main product was again observed to be MBP-PsyBIL protein, 
as displayed by comparing the quantity thereof produced to that of the MBP-CBD 

15 protein product when both were purified using amylose beads (Figure 6a, lane 3). 

The identities of the PsyBIL reaction products, the MBP-CBD and MBP-BIL 
protein products were also confirmed by mass spectrometry analysis (Figures 6c and 
6d, respectively). The measured mass of the MBP-CBD protein (50,602.07 Da) was 
found to be in close agreement to the expected mass of the unmodified protein 

20 (50,266.39 Da). The measured and expected masses of MBP-BIL protein were also 
found to be in close agreement: 59,332.79 and 59,070.11 Da, respectively. A 
prominent peak with a mass of 43,303 Da, was also observed in MALDI spectra of 
electroeluted 50 kDa MBP-CBD protein. Such cross-contamination can be observed 

■ 

in gel purified protein bands (A. Shainskaya, unpublished). Reactivity , of the 43 kDa 
25 band with anti MBP tag antibody (Figure 6b) indicates that this band is a truncated 
product. 

Peptide mass mapping of the 50.1 and 59.3 kDa protein products proteins by 
MALDI analysis (Tables 3 and 4, respectively) confirmed their assigned identity as 
MBP-CBD (Figure 7; SEQ ID NO: 108) and MBP-BIL (Figure 8; SEQ ID NO: 109), 
30 respectively, in particular by identifying the splice junction of the MBP-CBD protein 
(peptide position 388-396; Figure 7; Table 3) and the carboxy terminal end of the 
MBP-BIL protein (peptide position 535-541; Figure 8; Table 4) with accuracies of 27 
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and 1 00 ppm, respectively. 

Thus, following extensive experimentation, the splicing junction and cleavage 
points were found to precisely correspond to those predicted from the sequence 
similarity of the BIL and intein domains, thereby unambiguously demonstrating BIL 
5 domain autoprocessing capacity. 



Table 3. MALDI identification of peptides of the 50.1 kDa MBP-CBD splicing 

product ofMBP-PsyBIL-CBD. 



Peptide 
position 


FM+H1* 

L J 

calculated 

mass (Da) ! 


rM+Hl + 

11"! ■ A 11 

measured 
mass (Da) 


Mass 

accuracy 

(ppm) 


1-2 


278.1538 


278.1460 


28 


3-7 


563.2677 


563.2599 


13 


8-16 


1057.604 


1057.596 


7 


8-26 


2047.363 


2047.35 


6 


27-35 


1064.532 


1064.586 


50 


28-35 


936.442 


936.491 


52 


28-30 


423.2244 


423.215 


22 


90-99 


1267.647 


1267.6 


37 


129-138 


1201.522 


1201.602 


66 


129-141 


1571.732 


1571.823 


57 


191-201 


1188.642 


1188.711 


58 


172-180 


1 129.55 


1129.57 


17 


253-274 


2137.972 


2138.147 


81 


279-296 


2095.812 


2096.030 


104 


297-306 


1010.472 


1010.612 


138 


328-345 


2109.02 


2109.017 


1 


356-387 


3461.516 


3461.33 


53 


364-387 


2576.53 


2576.41 


46 


388-396 


983.48 


983.55 


71 


397-^35 


3986.34* 


3986.13 - 


52 


397-^38 


4380.80* 


4379.90 

i 


205 


439-461 


2634.93* 


2634.72 ; 


79 



mass corresponds to peptide with an alkylated cysteine residue 



10 

These results therefore fully and clearly demonstrate the capacity of BIL 
domains to auto-cleave and auto-splice flanking sequences, similarly to inteins. The 
presently described BIL domains therefore represent a novel and highly useful class of 
autoprocessing proteins which can be harnessed for manipulating and modifying 
15 proteins, for example as depicted in Figure 9. These results furthermore demonstrate 
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the suitability of utilizing genetically transformed host cells, such as E. coli, to 
industrially express chimeric proteins which comprise functional BIL domains. 



Table 4. MALDI-identification of carboxy terminal peptides of the 59.3 kDa MBP- 
5 PsyBIL carboxy terminal cleavage product of MBP-PsyBIL-CBD. 



Peptide 
position 


[M+H] + calculated mass (Da) 


[M+H] + measured mass 
(Da) 


Mass accuracy 
(ppm) 


535-541 


897.4947 


897.47 


27 


537-541 


656.3156 


656.25 


100 



Discussion: BIL domains are present in several hyper-variable bacterial 
proteins, such as FhaB adhesins and MafB proteins of Neisseria strains. Their 
immediate flanks are the most variable portions of the proteins and they themselves 
1 0 are not always present in these proteins, even in closely related strains of the same 
species. Some, and perhaps all, proteins with BIL domains seem to be secreted 
proteins. BIL domains might enhance the variability of secreted proteins by their 
protein splicing and cleavage activity as detailed below. 

As described above, the amino terminal ends of BIL domains, and of Hint 
15 domains of inteins and Hog proteins are very similar (Figures 3a-z). Thus all these 

■ 

domains probably form labile ester bonds on their amino terminal ends. In proteins 
with BIL domains these ester bonds could be attacked by various nucleophilic 
molecules, including peptides, proteins and small reactive compounds, such as 
glutathione or cysteine. Such reactions would ligate the attacking nucleophiles to a 

20 carboxy terminal position of the host protein and release the BIL domain and the host 
protein region downstream to it. This is analogous to Hedgehog protein maturation 
where the Hint domain mediates the attachment of a cholesterol molecule to the 
cleaved Hedge domain. In adhesins with BIL domains this putative ligation might 
serve to covalently attach the bacteria to its adhesion target. Additionally, released 

25 BIL and carboxy terminal domains could have a function of their own. For example, 
in pathogenic bacteria that have such proteins, the released domains could serve as 
decoys to the immune system. 

In Neisseria strains, sequences encoding BIL domains appear as either as short 

* 

open reading frames downstream of MafB genes and in the carboxy terminal ends of 
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these proteins upstream of a variable domain. This suggests that, at least in such 
Neisseria strains, BIL domains function as cassettes which can be fused to genes by 
genetic rearrangement to promote the variability of the encoded proteins. Other 

microevolutionary processes in Neisseria and Ralstonia solanacearum, a plant 

» 

5 pathogen bacterium with a wide host range, are known to generate different carboxy 
terminal ends for surface-exposed and virulence proteins (Parkhill et al, 2000. 
Nature, 404, 502-506; Salanoubat et al, 2002. Nature 415, 497-502). 

Not all species with BIL domains are pathogens and many pathogenic bacteria 
with fully sequenced genomes do not have BIL domains. BIL domains might be used 

10 in different processes not connected with pathogenicity. For example, BIL domain 
activity might be one way for bacteria to attach to diverse surfaces. 

In summary, two novel types of Hint domain-containing proteins, BIL Types 
A and B, were identified. Both types have the active site sequence features of the 
Hint domains but also possess sequence features that distinguish them from the known 

15 Hint domains and from each other. BIL domains appear in different proteins from 
diverse bacteria, including pathogenic species of humans and plants, such as Neisseria 
meningitidis and P. syringae. These domains are present in variable protein regions 
and are typically flanked by domains that also appear in secreted proteins such as 
filamentous hemagglutinin and calcium binding RTX repeats. Phylogenetic and 

20 genomic analysis of BIL domain sequences suggests that they were positively selected 
for in different lineages. Type A and Type B Bit, domains were cloned and shown to 
display auto-cleavage and auto-splicing of flanking polypeptide sequences in an in- 
vitro transcription/translation system, as well as when overexpressed in E. coli, 
thereby indicating the capacity of BIL domains to autocatalyze post-translational 

25 modifications of host proteins. 

Conclusion: The above-described experimental results demonstrate the 
capacity of the autoprocessing polypeptides of the present invention to efficiently 
auto-cleave and auto-splice flanking sequences. The presently described experimental 
results furthermore demonstrate the feasibility of utilizing genetically transformed 

30 host cells, such as E. coli, for efficient industrial production of such autoprocessing 
polypeptides. Thus, the autoprocessing polypeptides and the polynucleotides 
encoding such polypeptides of the present invention significantly expand and enhance 
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the available repertoire of available autoprocessing polypeptides having utility in 
numerous commercially important protein engineering applications, such as protein 
purification, affinity selection of display phages and post-translational protein 
ligation. 



EXAMPLE 2 

* 

Supplementary evidence demonstrating C-terminal in-vitro and in-vivo 
autocleavage by chimeric protein including the Type B BIL domain BIL2_rhosp 
Materials and Methods: 
10 In order to analyze the capacity of Type B BIL domain BIL2_rhosp (Table 1, 

Figure lb) to display autoprocessing activity, genetic sequences encoding this BIL 
domain including one flanking amino acid residue at each terminus were cloned for 
expression as a chimeric protein tagged at its amino terminal end with the malE gene- 
encoded maltose-binding protein (MBP) affinity tag, and at the carboxy terminal end 
15 with the B. circulans cbd gene-encoded chitin-binding domain (CBD) affinity tag. 
The resultant "MBP-RspBIL2a-CBD" chimera was expressed in-vivo and in-vitro and 
resulting protein products were analyzed for evidence of BIL domain-mediated 
autoprocessing activity according to methods described in Example 1 above. 
Experimental Results: 

20 The in-vivo expressed MBP-RspBIL2a-CBD chimera was shown to display C- 

terminal auto-cleavage activity via amylose-based affinity purification, electrophoretic 
separation, and Coomassie blue staining of the electrophoretically separated proteins 
(Figure 10a). The in-vivo expressed MBP-RspBIL2a-CBD chimera was also shown 
to display C-terminal cleavage via Western immunoblotting analysis of SDS-PAGE 

25 separated proteins (Figure 10b). The in-vitro expressed MBP-RspBIL2a-CBD 
chimera was shown to display C-terminal cleavage following [ 35 S]-methionine- 
labeling and autoradiography of electrophoretically separated protein (Figure 10c). 
Evidence from intact mass mass-spectrometry of the MBP-BIL specific carboxy 
terminal cleavage product indicates that the C-terminal end of this product is located 

30 6-11 amino acid residues from the predicted carboxy terminal end of the BIL domain 
towards the N-terminus. Evidence for the MBP-RspBIL2a identity of the cleavage 
product was obtained from Western Blot, and affinity column analysis. The presence 
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of the protein chaperone DnaK was detected during affinity purification of the BIL 
products. This chaperone may bind the BIL domain, and may also be involved in its 
activity. 

Conclusion: There is ample evidence that a chimeric protein which comprises 
5 the type B BIL domain BIL2_rhosp has the capacity to efficiently display carboxy 
terminal auto-cleavage activity. Hence, such a BIL domain can therefore be 
advantageously exploited in applications benefiting from an auto-cleaving chimeric 
protein. 



10 EXAMPLE 3 

N-terminal autocleavage of in-vivo expressed chimeric polypeptide comprising the 

Type B BIL domain 4825_rhosp 
Materials and Methods: 

In order to analyze the capacity of Type B BIL domain 4825_rhosp (Table 1, 
15 Figure lb) to display autoprocessing activity, genetic sequences encoding this BIL 
domain with 14 amino terminal-flanking and 51 carboxy terminal-flanking amino 
acids were cloned for expression as a chimeric protein tagged at its amino terminal 
end with the malE gene-encoded maltose-binding protein (MBP) affinity tag, and at 
the carboxy terminal end with the B. circulans cbd gene-encoded chitin-binding 
20 domain (CBD) affinity tag. The resultant "MBP-4825rhosp-CBD" chimera was 
expressed in-vivo and resulting protein products were analyzed for evidence of BIL 
domain-mediated autoprocessing activity according to methods described in Example 
1 above. 

Experimental Results: 

25 The in-vivo expressed MBP-4825rhosp-CBD chimera was shown to display N- 

terminal auto-cleavage activity via chitin or amylose based affinity purification of 
expressed protein and electrophoretic separation, and Coomassie blue staining of 
electrophoretically separated protein (Figure 11a). The chimera was also shown to 
display N-terminal cleavage via Western immunoblotting analysis ' of SDS-PAGE 

30 separated proteins using anti-CBD and anti-MBP antibodies (Figure l ib). The 
cleavage site was found to be located exactly between the predicted N-termifial 
residue of the BIL domain and the amino terminal flanking sequence as demonstrated 
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by amino terminal end sequencing. Large amounts of the chaperone GroEL were 
detected during the purification of the protein products when overexpressing the 
MBP-4825rhosp-CBD chimera the whereas no GroEL was detected when 
overexpressing control chimera lacking the BIL domain. 

Conclusion: A chimeric protein which comprises the type B BIL domain 
4825_rhosp ; has the capacity to efficiently display N-terminal auto-cleavage activity. 
Hence, such a BIL domain can therefore be advantageously exploited in applications 
benefiting from an auto-cleaving chimeric protein. 



10 EXAMPLE 4 

Auto-splicing and C-terminal auto-cleavage of in-vivo expressed chimeric protein 

including the Type A BIL domain BIL4_cloth 
Materials and Methods: 

In order to analyze the capacity of Type A BIL domain BEL4_cloth (Table 1, 
15 Figure la) to display autoprocessing activity, genetic sequences encoding this BIL 
domain were cloned, including one flanking amino acid residue of the host protein at 
the C-terminal end, for expression as a chimeric protein tagged at its amino terminal 
end with the malE gene-encoded maltose-binding protein (MBP) affinity tag, and at 
the carboxy terminal end with the B. circulans cbd gene-encoded chitin-binding 
20 domain (CBD) affinity tag. The resultant 'TVlBP-BIL4cloth-CBD" chimera was 
expressed in-vivo and resulting protein products were analyzed for evidence of BIL 
domain-mediated autoprocessing activity according to methods described in Example 
1 above. 

Experim ental Results: 

25 The in-vivo expressed MBP-BIL4cloth-CBD chimera was shown to display 

auto-splicing activity (MBP-CBD specific fragment) as shown by Western 
immunoblotting assay using anti CBD and anti MBP antibody probes (Figures 12a 
and 12b, respectively), and as shown by Coomassie blue staining of 

* t i 

electrophoretically separated protein products isolated via amylose based or chitin 
30 based affinity chromatography depicting auto-splicing activity. The identity of the 
auto-splicing product was verified via mass-spectrometry peptide mapping (data not 
shown). Carboxy terminal auto-cleavage activity (MBP-BIL domain specific 
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fragment) was also detected by Western immunoblotting assay using an anti MBP 
antibody probe (Figure 12b) and by Coomassie blue staining of electrophoretically 
separated protein products isolated via amylose based affinity chromatography 
(Figure 12c). 



as the type A BIL domain BIL4_cloth, has the capacity to efficiently display auto- 
splicing and carboxy terminal auto-cleaving activity. Hence, such a BIL domain can 
therefore be advantageously exploited in applications benefiting from an auto-splicing 
and/or carboxy terminal auto-cleaving chimeric protein. 



It is appreciated that certain features of the invention, which are, for clarity, 
described in the context of separate embodiments, may also be provided in 
combination in a single embodiment. Conversely, various features of the invention, 
which are, for brevity, described in the context of a single embodiment, may also be 
15 provided separately or in any suitable subcombination. 

Although the invention has been described in conjunction with specific 
embodiments thereof, it is evident that many alternatives, modifications and variations 
will be apparent to those skilled in the art. Accordingly, it is intended to embrace all 
20 such alternatives, modifications and variations that fall within the spirit and broad 
scope of the appended claims. All publications, patents, patent applications and 
sequences identified by their accession numbers mentioned in this specification are 
herein incorporated in their entirety by reference into the specification, to the same 

■ 

extent as if each individual publication, patent, patent application or sequence 
25 identified by their accession number was specifically and individually indicated to be 
incorporated herein by reference. In addition, citation or identification of any 
reference in this application shall not be construed as an admission that such reference 
is available as prior art to the present invention. 



5 



Conclusion: A chimeric protein which comprises A type a BIL domain, such 
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1. A chimeric polypeptide comprising an autoprocessing segment having 
an amino acid sequence set forth by SEQ ID NO: 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 
18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 
41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 
64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 
87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105 or 106. 

2. The chimeric polypeptide of claim 1, wherein said autoprocessing 
segment is selected from the group consisting of BILl_cloth, BIL2_cloth, 
BIL3_cloth, BIL4_cloth, BIL5_cloth, BIL6_cloth, BIL7_cloth, BIL9_cloth, 
BIL 1 0_cloth, BIL 1 l_cloth, 3 875_87_magma, FhaB_manha, BIL2_neigo, 
BIL3_neigo, BEL5 jneigo, BIL6 neigo, MafBl neigo, MafB2_neigo, 
B0369+_neimeB, B0372+_neimeB, B0655+_neimeB, A2 1 1 5_neime, BIL2_neimeC, 
BIL3_neimeC, BIL4_neimeC, BDL5_neimeC, BIL6_neimeC, MafBl_neimeC, 

■ 

FhaBl_psefl-PfO-l, FhaBljpsefl-SBW25, FhaBjpsesy, SCP1.201_strco, 
39_9_thefus, BILl_gemob, BIL2_gemob, 0709_lepin, 3725_lepin, 3719_lepin, 
o665_myxxa, ol078_myxxa, ol070_myxxa, BILl_strav, BIL2_strav, BIL3_strav, 
BILljpirsp, BDLl_chrvi, BILl_glovi, BIL2_glovi, BIL3_glovi, BBL4_glovi, 
BIL5_glovi, BDL6_glovi, BIL7_glovi, o649_versp, o5687_versp, o3395_versp, 
II0519_brume, BBL2_magma, BEL3_magma, BIL4_magma, BIL5_magma, 
BIL6_magma, 06786_metex, 00126_rhoca, 00199_rhoca, 00459_rhoca, 00460_rhoca, 
00746_rhoca, 00949_rhoca, O1216_rhoca, 01374_rhoca, O1523_rhoca, 01524_rhoca, 
02710_rhoca, 03530_rhoca, 4825_rhosp, BIL2_rhosp, BILl_silpo, BEL2_silpo, 
BIL3_silpo, BIL4_silpo, BIL5_silpo, BIL6_silpo, BIL7_silpo 3 BIL8_silpo, 
BIL9_silpo, BIL10_silpo, BILll_silpo, BIL12_silpo, BIL13_silpo, BIL14_silpo, 
BIL 1 5_silpo, BIL 1 6_silpo, Bil l_rhile, BIL l_unknwn, and BIL2_unfcnwn. 

3. The chimeric polypeptide of claim 1, wherein said autoprocessing 
segment is derived from a protein of an organism belonging to a genus selected from 
the group consisting of Brucella, Clostridium, Magnetospirillum, Mannheimia, 



WO 2004/043997 





YIL2003/000956 



59 



Methylobacterium, Neisseria, Pseudomonas, Rhodobacter, Silicibacter, Streptomyces, 
Thermobifida, Rhizobium, Chromobacterium, Myxococcus, Leptospira, Pirellula, 
Gemmata, Gloeobacter and Verrucomicrobium. 

4. The chimeric polypeptide of claim 3, wherein said organism is selected 
from the group consisting of Rhodobacter capsulatus, Rhodobacter sphaeroides, 
Silicibacter pomeroyi, Brucella melitensis, Brucella suis, Magnetospirillum 
magnetotacticum, Methylobacterium extorquens, Rhizobium leguminosarum, 
Neisseria meningitidis, Neisseria meningitidis, Neisseria meningitidis, Neisseria 
gonorrhoeae, Chromobacterium violaceum, Pseudomonas syringae, Pseudomonas 
fluorescens, Pseudomonas fluorescens, Mannheimia haemolytica, Myxococcus 
xanthus, Leptospira interrogans, Streptomyces coelicolor, Streptomyces avermitilis, 
Thermobifida fusca, Clostridium thermocellum, Pirellula species 1, Gemmata 
obscuriglobus, Gloeobacter violaceus, and Verrucomicrobium spinosum. 

5. The chimeric polypeptide of claim 1, wherein said auto-cleavage 
results in removal of a segment of the polypeptide adjacent to an amino terminal end 
or a carboxy terminal end of said autoprocessing segment. 

6. The chimeric polypeptide of claim 5, wherein said segment of the 
polypeptide adjacent to said autoprocessing segment is an amino terminal segment or 
a carboxy terminal segment of the polypeptide. 



polypeptide adjacent to said carboxy terminal end of said autoprocessing segment 
includes an amino acid residue comprising a nucleophilic group at an amino terminal 
end thereof 



7. 



The chimeric polypeptide of claim 5, wherein said segment of the 



8. 



The chimeric polypeptide of claim 7, wherein said nucleophilic group 



is a hydroxy! group. 



9. 



The chimeric polypeptide of claim 7, wherein said amino acid residue 
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is a threonine residue. 



10. The chimeric polypeptide of claim 5, wherein said segment of the 
polypeptide adjacent to said amino terminal end of said autoprocessing segment 
includes a serine amino acid residue at a carboxy terminal end thereof. 

11. The chimeric polypeptide of claim 1, wherein said auto-cleavage 
results in auto-splicing. 

12. The chimeric polypeptide of claim 11, wherein said auto-splicing is 
auto-splicing of segments of the polypeptide flanking said autoprocessing segment. 

13. The chimeric polypeptide of claim 1, wherein the chimeric polypeptide 
is capable of said auto-cleavage under a condition selected from the group consisting 
of a temperature selected from a range of 33 °C to 41 °C, a pH selected from a range 
of pH 7.8 to pH 8.2, and a concentration of dithiothreitol selected from a range of 0.1 
mM to 20 mM. 

14. The chimeric polypeptide of claim 1, further comprising an affinity tag 
capable of specifically binding a substrate. 

15. The chimeric polypeptide of claim 14, wherein said affinity tag. is a 
maltose-binding domain or a chitin-binding domain. 

16. The chimeric polypeptide of claim 14, wherein said substrate is 
selected from the group consisting of a molecule, a compound, a virus, and a cell. 

17. The chimeric polypeptide of claim 16, wherein said molecule is 

» 

amylose or chitin. 

18. The chimeric polypeptide of claim 16, wherein said virus is a 
bacteriophage. 
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19. A polynucleotide encoding a chimeric polypeptide comprising an 
autoprocessing segment having an amino acid sequence set forth by SEQ ID NO: 8, 9, 
10, 1 1, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 
33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 
56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 
79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 
101, 102, 103, 104, 105 or 106, the polypeptide being capable of auto-cleavage. 

20. The polynucleotide of claim 19, wherein said autoprocessing segment 
is selected from the group consisting of BILl cloth, BIL2_cloth, BIL3_cloth, 
BIL4_cloth, BIL5_cloth, BIL6__cloth, BIL7_cloth, BIL9_cloth, BIL10_cloth, 
BIL1 l_cloth, 3875_87_magma, FhaB_manha, BIL2_neigo, BIL3_neigo, BIL5_neigo, 
BIL6_neigo, MafBl_neigo, MafB2_neigo, B0369+_neimeB, B0372+_neimeB, 
B0655+_neimeB, A2115_neime, BIL2_neimeC, BIL3_neimeC, BIL4_neimeC, 
BIL5_neimeC, BIL6_neimeC, MafB I neimeC, FhaBl_psefl-PfO-l, FhaBl _psefl- 
SBW25, FhaB_psesy, SCP1.201_strco, 39_9_thefus, BILl_gemob, BIL2 _gemob, 
0709_lepin, 3725_lepin, 3719_lepin, o665_myxxa, ol078_myxxa, ol070_myxxa, 
BILl_strav, BIL2_strav, BIL3_strav, BILl_pirsp, BILl_chrvi, BILl_glovi, 
BIL2_glovi, BIL3_glovi, BIL4_glovi, BIL5_glovi, BIL6_glovi, BIL7_glovi, 
o649_versp, o5687_versp, o3395_versp, n0519_brume, BIL2_magma, BIL3_magma, 
BIL4_magma, BIL5_magma, BIL6_magma, 06786_metex, 00126_rhoca, 
O0199_rhoca, 00459_rhoca, 00460_rhoca, 00746_rhoca, 00949_rhoca, 01216_rhoca, 
0l374_rhoca, 01523_rhoca, 01524_rhoca, 02710_rhoca, 03530_rhoca, 4825_rhosp, 
BIL2_rhosp, BILl_silpo, BIL2_silpo, BIL3_silpo, BIL4_silpo, BIL5_silpo, 
BIL6_siIpo, BIL7_silpo, BIL8_silpo, BIL9_silpo, BIL10_silpo, BILll_silpo, 
BIL12__silpo, BIL13_silpo, BIL14_silpo, BIL15_silpo, BIL16_silpo, Bill_rhile, 
BILl_unknwn, and BIL2_unknwn. 



21. The polynucleotide of claim 19, wherein said autoprocessing segment 
is derived from a protein of an organism belonging to a genus selected from the group 
consisting of Brucella, Clostridium, Magnetospirillum, Mannheimia, 
Methylobacterium, Neisseria, Pseudomonas, Rhodobacter, Silicibacter, Streptomyces, 
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Thermobifida, Rhizobium, Chromobacterium, Myxococcus, Leptospira, Pirellula, 
Gemmata, Gloeobacter and Vetrucomicrobium. 



22. The polynucleotide of claim 21, wherein said organism is selected from 
the group consisting of Rhodobacter capsulatus, Rhodobacter sphaeroides, 
Silicibacter pomeroyi, Brucella melitensis, Brucella suis, Magnetospirillum 
magnetotacticum, Methylobacterium extorquens, Rhizobium leguminosarum, 
Neisseria meningitidis, Neisseria meningitidis, Neisseria meningitidis, Neisseria 
gonorrhoeae, Chromobacterium violaceum, Pseudomonas syringae, Pseudomonas 
fluorescens, Pseudomonas Jluorescens, Mannheimia haemolytica, Myxococcus 
xanthus, Leptospira interrogans, Streptomyces coelicolor, Streptomyces avermitilis, 
Thermobifida fusca, Clostridium thertnocellum, Pirellula species 1, Gemmata 
obscuriglobus, Gloeobacter violaceus, and Verrucomicrobium spinosum. 



23. The polynucleotide of claim 19, wherein said auto-cleavage results in 
removal of a segment of the chimeric polypeptide adjacent to an amino terminal end 
or a carboxy terminal end of said autoprocessing segment. 

24. The polynucleotide of claim 23, wherein said segment of the chimeric 
polypeptide adjacent to said autoprocessing segment is an amino terminal segment or 
a carboxy terminal segment of the chimeric polypeptide. 

25. The polynucleotide of claim 23, wherein said segment of the chimeric 
polypeptide adjacent to said carboxy terminal end of said autoprocessing segment 
includes an amino acid residue comprising a nucleophilic group at an amino terminal 
end thereof. 

26. The polynucleotide of claim 25, wherein said nucleophilic group is a 
hydroxyl group. 

27. The polynucleotide of claim 25, wherein said amino acid residue is a 
threonine residue. 
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28. The polynucleotide of claim 23, wherein said segment of the chimeric 
polypeptide adjacent to said amino terminal end of said autoprocessing segment 
includes a serine amino acid residue at a carboxy terminal end thereof. 

29. The polynucleotide of claim 19, wherein said auto-cleavage results in 
auto-splicing. 

30. The polynucleotide of claim 29, wherein said auto-splicing is auto- 
splicing of segments of the chimeric polypeptide flanking said autoprocessing 
segment. 

31. The polynucleotide of claim 19, wherein the chimeric polypeptide is 
capable of said auto-cleavage under a condition selected from the group consisting of 
a temperature selected from a range of 33 °C to 41 °C, "a pH selected from a range of 
pH 7.8 to pH 8.2, and a concentration of dithiothreitol selected from a range of 0.1 
mM to 20 mM. 

32. The polynucleotide of claim 19, wherein the chimeric polypeptide 
further comprises an affinity tag capable of specifically binding a substrate. 

33. The polynucleotide of claim 32, wherein said affinity tag is a maltose- 
binding domain or a chitin-binding domain. 

34. The polynucleotide of claim 32, wherein said substrate is selected from 
the group consisting of a molecule, a compound, a virus, and a cell. 

■ 

35. The polynucleotide of claim 33, wherein said molecule is amylose or 

chitin. 

i 

36. The polynucleotide of claim 33, wherein said virus is a bacteriophage. 

37. The polynucleotide of claim 19, further comprising a promoter 
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sequence being for directing expression of the chimeric polypeptide in an expression 
system. 

38. The polynucleotide of claim 37, wherein said expression system is a 
cellular expression system or a cell-free expression system. 

* 

39. The polynucleotide of claim 38, wherein said cellular expression 
system is an E. coli cellular expression system. 

40. The polynucleotide of claim 38, wherein said cell-free expression 
system is an E. coli S30 extract expression system. 

41. The polynucleotide of claim 37, wherein said promoter sequence is 
inducible by isopropyl beta-D-thiogalactoside. 

42. A nucleic acid construct comprising a nucleic acid sequence encoding a 
chimeric polypeptide comprising an autoprocessing segment having an amino acid 
sequence set forth by SEQ ID NO: 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 
22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 
45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 
68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 
91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105 or 106, said chimeric 
polypeptide being capable of auto-cleavage. 

43. The nucleic acid construct of claim 42, further comprising a promoter 
sequence being for directing expression of said chimeric polypeptide in an expression 
system. 

m * 

44. The nucleic acid construct of claim 43, wherein said expression system 
is a cellular expression system or a cell-free expression system. 



45. The nucleic acid construct of claim 44, wherein said cellular expression 
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system is an E. coli cellular expression system. 

46. The nucleic acid construct of claim 44, wherein said cell-free 
expression system is an E. coli S30 extract expression system. 

47. . The nucleic acid construct of claim 43, wherein said promoter sequence 
is inducible by isopropyl beta-D-thiogalactoside. 



48. The nucleic acid construct of claim 42, wherein said autoprocessing 
segment is selected from the group consisting of BILl_cloth, BIL2_cloth, 
BIL3_cloth, BIL4_cloth, BIL5_cloth, BIL6_cloth, BEL7_cloth, BIL9_cloth, 
BIL10_cloth, BILll_cloth, 3875_87_magma, FhaB_manha, BIL2_neigo, 
BIL3_neigo, BBL5_neigo, BIL6_neigo, MafBlneigo, MafB2_neigo, 
B0369+_neimeB, B0372+_neimeB, B0655+_neimeB, A2115_neime, BIL2_neimeC, 
BIL3_neimeC, BIL4_neimeC, BIL5_neimeC, BIL6_neimeC, MafB l_neimeC, 
FhaBl_psefl-PfO-l, FhaBl_psefl-SBW25, FhaB_psesy, SCP1.201_strco, 
39_9_thefus, BILl_gemob, BIL2_gemob, 0709_lepin, 3725_lepin, 3719_lepin, 
o665_myxxa, ol078_myxxa, ol070_myxxa, BILl_strav, BIL2_strav, BIL3_strav, 
BILl_pirsp, BILl_chrvi, BIL1 _glovi, BIL2_glovi, BIL3_glovi, BIL4_glovi, 
BIL5_glovi, BIL6_glovi, BIL7 _glovi, o649_versp, o5687_versp, o3395_versp, 
II0519_brume, BIL2_magma > BIL3_magma, BIL4_magrna, BIL5_magma, 
BIL6_magma, 06786_metex, 00126_rhoca, 00199_rhoca, 00459_rhoca, 00460_rhoca, 
00746_rhoca, 00949_rhoca, 01216_rhoca, 01374_rhoca, 01523_rhoca, 01524_rhoca, 
02710_rhoca, 03530_rhoca, 4825_rhosp, BIL2_rhosp, BILl_silpo, BDL2_silpo, 
BIL3_silpo, BIL4_silpo, BIL5_silpo, BIL6_silpo, BIL7_silpo, BIL8_silpo, 
BIL9_silpo, BIL10_silpo, BILll_silpo, BIL12_silpo, BBL13_silpo, BIL14_silpo, 
BILlSsilpo, BIL16_silpo, Bill rhile, BILl_unknwn, and BIL2_unknwn. 

49. The nucleic acid construct of claim 42, wherein said autoprocessing 
segment is derived from a protein of an organism belonging to a genus selected from 
the group consisting of Brucella, Clostridium, Magnetospirillum, Mannheimia, 
Methylobacterium, Neisseria, Pseudomonas, Rhodobacter. Silicibacter, Streptomyces, 
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Tliermobifida, Rhizobium, Chromobacterium, Myxococcus, Leptospira, Pirellula, 
Gemmata, Gloeobacter and Verrucomicrobium. 

50. The nucleic acid construct of claim 49, wherein said organism is 
selected from the group consisting of Rhodobacter capsulatus, Rhodobacter 
sphaeroides; Silicibacter pomeroyi, Brucella melitensis, Brucella suis, 
Magnetospirillum magnetotacticum, Methylobacterium extorquens, Rhizobium 
leguminosarum, Neisseria meningitidis, Neisseria meningitidis, Neisseria 
meningitidis, Neisseria gonorrhoeae, Chromobacterium violaceum, Pseudomonas 
syringae, Pseudomonas fluorescens, Pseudomonas fluorescens, Mannheimia 
haemolytica, Myxococcus xanthus, Leptospira interrogans, Streptomyces coelicolor, 
Streptomyces avermitilis, Thermobifida fusca, Clostridium thermocellum, Pirellula 
species 1, Gemmata obscuriglobus, Gloeobacter violaceus, and Verrucomicrobium 



51. The nucleic acid construct of claim 42, wherein said auto-cleavage 
results in removal of a segment of said chimeric polypeptide adjacent to an amino 
terminal end or a carboxy terminal end of said autoprocessing segment. 

52. The nucleic acid construct of claim 51, wherein said segment of said 
chimeric polypeptide adjacent to said autoprocessing segment is an amino terminal 
segment or a carboxy terminal segment of said chimeric polypeptide. 

■ 

53. The nucleic acid construct of claim 51, wherein said segment of said 
chimeric polypeptide adjacent to said carboxy terminal end of said autoprocessing 
segment includes an amino acid residue comprising a nucleophilic group at an amino 

• * 

terminal end thereof. 

54. The nucleic acid construct of claim 53, wherein said nucleophilic group 
is a hydroxyl group. 

55. The nucleic acid construct of claim 53, wherein said amino acid residue 



spinosum. 
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is a threonine residue. 

56. The nucleic acid construct of claim 51, wherein said segment of said 
chimeric polypeptide adjacent to said amino terminal end of said autoprocessing 
segment includes a serine amino acid residue at a carboxy terminal end thereof. 

57. The nucleic acid construct of claim 42, wherein said auto-cleavage 
results in auto-splicing. 

58. The nucleic acid construct of claim 57, wherein said auto-splicing is 
auto-splicing of segments of said chimeric polypeptide flanking said autoprocessing 
segment. 

59. The nucleic acid construct of claim 42, wherein said chimeric 
polypeptide is capable of said auto-cleavage under a condition selected from the group 
consisting of a temperature selected from a range of 33 °C to 41 °C, a pH selected 
from a range of pH 7.8 to pH 8.2, and a concentration of dithiothreitol selected from a 
range of 0. 1 mM to 20 mM. 

60. The nucleic acid construct of claim 42, wherein said chimeric 
polypeptide further comprises an affinity tag capable of specifically binding a 
substrate. 

6 1 . Hie nucleic acid construct of claim 60, wherein said affinity tag is a 
maltose-binding domain or a chitin-binding domain. 

■ » 

62. A method of generating a chimeric polypeptide capable of displaying 
auto-cleavage, the method comprising generating a chimeric amino acid sequence 
including an autoprocessing segment, said autoprocessing segment having an amino 
acid sequence set forth by SEQ ID NO: 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 

■ 

21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 
44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 



WO 2004/043997 




T7IL2003/000956 



68 

67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 
90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105 or 106, thereby 
producing the chimeric polypeptide capable of displaying auto-cleavage. 

63. The method of claim 62, wherein said generating said chimeric amino 
acid sequence is effected by synthesizing a polynucleotide encoding said chimeric 
amino acid sequence and expressing said polynucleotide in an expression system. 

64. The method of claim 63, wherein said expression system is a cellular 
expression system or a cell-free expression system. 

65. The method of claim 64, wherein said cellular expression system is an 
E. coli cellular expression system. 

66. The method of claim 64, wherein said cell-free expression system is an 
E. coli S30 extract expression system. 

67. The method of claim 63, wherein said polynucleotide comprises a 
promoter sequence being for directing said expression of the chimeric polypeptide. 

68. The method of claim 67, wherein said promoter sequence is inducible 
by isopropyl beta-D-thiogalactoside. 

69. The method of claim 62, wherein said autoprocessing segment is 
selected from the group consisting of BILl_cloth, BIL2_cloth, BIL3_cloth, 
BIL4_cloth, BIL5_cloth, BIL6_cloth, BIL7_cloth, BIL9_cloth, BIL10_cloth, 
BIL1 l_cloth, 3875_87_magma, FhaBjmanha, BIL2_peigo, BIL3_neigo, BIL5_jieigo, 
BIL6_neigo, MafB l_neigo, MafB2_neigo, B0369+jneimeB, B0372+_neimeB, 
B06554-_neimeB, A2 1 1 5_neime, BIL2_neimeC, BIL3_neimeC, BIL4_neimeC, 
BIL5_neimeC, BIL6_neimeC, MafBl_neimeC, FhaBljpsefl-PfO-1, FhaBl _j)sefl- 
SBW25, FhaB_psesy, SCP1.201_strco, 3?_9__thefus, BILl^gemob, BlL2_gemob, 
0709Jepin, 3725_iepin, 3719_lepin, o665jmyxxa, ol078_myxxa, ol070_myxxa, 
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BILlstrav, BIL2_strav, BIL3_strav, BILljpirsp, BILlchrvi, BILl_glovi, 
BIL2_glovi, BDL3_glovi, BIL4__glovi, BIL5_glovi, BIL6_glovi, BEL7_glovi, 
o649_versp, o5687_versp, o3395_versp, H05 1 9_brume, BIL2_magma, BIL3_magma, 
BIL4_magma, BIL5_magma, BIL6_magma, 06786_metex, 00126_rhoca, 
00199_rhoca, 00459_rhoca, 00460_rhoca, 00746_rhoca, 00949_rhoca, 01216_rhoca, 
01374_rhoca, 01523_rhoca, 01524_rhoca, 02710_rhoca, 03530_rhoca, 4825_rhosp, 
BIL2_rhosp, BILl_silpo, BIL2_silpo, BIL3_silpo, BDL4_silpo, BIL5_siIpo, 
BDL6_silpo, BIL7_silpo, BIL8_silpo, BIL9_silpo, BBL10_silpo, BILll_silpo, 
BIL12_silpo, BDL13_silpo, BEL14_silpo, BIL15_silpo, BIL16_silpo, Bill rhile, 
BILl_unknwn, and BIL2 unknwn. 

70. The method of claim 62, wherein said autoprocessing segment is 
derived from a protein of an organism belonging to a genus selected from the group 
consisting of Brucella, Clostridium, Magnetospirillum, Mannheimia, 
Methylobacterium, Neisseria, Pseudomonas, Rhodobacter, Silicibacter, Streptomyces, 
Thermobifida, Rhizobium, Chromobacterium, Myxococcus, Leptospira, Pirellula, 
Gemmata, Gloeobacter and Verrucomicrobium. 

71. The method of claim 70, wherein said organism is selected from the 
group consisting of Rhodobacter capsulatus, Rhodobacter sphaeroides, Silicibacter 
pomeroyi, Brucella rnelitensis, Brucella suis, Magnetospirillum magnetotacticum, 
Methylobacterium extorquens, Rhizobium leguminosarum, Neisseria meningitidis, 
Neisseria meningitidis, Neisseria meningitidis, Neisseria gonorrhoeae, 
Chromobacterium violaceum, Pseudomonas syringae, Pseudomonas fluorescens, 
Pseudomonas fluorescens, Mannheimia haemolytica, Myxococcus xanthus, 
Leptospira interrogans, Streptomyces coelicolor, Streptomyces avertnitilis, 
Thermobifida fiisca, Clostridium thermocellum, Pirellula species 1, Gemmata 
obscuriglobus, Gloeobacter violaceus, and Verrucomicrobium spinosum. 

72. The method of claim 62, wherein said auto-cleavage results in removal 
of a segment of the chimeric polypeptide adjacent to an amino terminal end or a 
carboxy terminal end of said autoprocessing segment. 
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73. The method of claim 72, wherein said segment of the chimeric 
polypeptide adjacent to said autoprocessing segment is an amino terminal segment or 
a carboxy terminal segment of the chimeric polypeptide. 



74. The method of claim 72, wherein said segment of the chimeric 
polypeptide, adjacent to said carboxy terminal end of said autoprocessing segment 
includes an amino acid residue comprising a nucleophilic group at an amino terminal 
end thereof. 



75. The method of claim 74, wherein said nucleophilic group is a hydroxy 1 

group. 



76. The method of claim 74, wherein said amino acid residue is a threonine 
residue. 

77. The method of claim 72, wherein said segment of the chimeric 
polypeptide adjacent to said amino terminal end of said autoprocessing segment 
includes a serine amino acid residue at a carboxy terminal end thereof. 

78. The method of claim 62, wherein said auto-cleavage results in auto- 
splicing. 

79. The method of claim 78, wherein said auto-splicing is auto-splicing of 
segments of the chimeric polypeptide flanking said autoprocessing segment. 

80. The method of claim 62, wherein the chimeric polypeptide is capable 

■ 

of said auto-cleavage under a condition selected from the group consisting of a 
temperature selected from a range of 33 °C to 41 °C, a pH selected from a range of pH 
7.8 to pH 8.2, and a concentration of dithiothreitol selected from a range of 0.1 mM to 
20 mM. 

* 

81 . The method of claim 62, wherein the chimeric polypeptide includes an 
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affinity tag. 

82. The method of claim 81, wherein said affinity tag is a maltose-binding 
domain or a chitin-binding domain. 

83. . A method of purifying a protein, the method comprising: 

(a) generating a chimeric polypeptide including an autoprocessing 
segment having an amino acid sequence set forth by SEQ ID NO: 8, 9, 
10, 1 1, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 
29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 
48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 
67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 
86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 
103, 104, 105 or 106, said autoprocessing segment being terminally 
attached to, or flanked by, an amino acid sequence of the protein, said 
chimeric polypeptide being capable of auto-cleavage when subjected to 
suitable conditions to thereby remove said amino acid sequence of the 
protein from said chimeric polypeptide thereby generating the protein; 

(b) immobilizing said chimeric polypeptide to a support; and 

(c) subjecting said chimeric polypeptide to said suitable conditions, 
thereby purifying the protein. 

84. The method of claim 83, further comprising the step of separating the 
protein from said autoprocessing segment following step (c). 

85. The method of claim 83, wherein said support includes an antibody or 
antibody fragment capable of specifically binding said autoprocessing segment, and 
whereas said immobilizing is via said autoprocessing segment. 

86. The method of claim 83, wherein said chimeric polypeptide further 
includes an affinity tag sequence, and whereas said immobilizing is via said affinity 
tag sequence. 
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87. The method of claim 86, wherein said support includes a specific 
ligand of said affinity tag sequence, and whereas said immobilizing is via said specific 
ligand of said affinity tag sequence. 

88. The method of claim 83, wherein said generating said chimeric 
polypeptide, is effected by synthesizing a polynucleotide encoding said chimeric 
polypeptide and expressing said polynucleotide in an expression system. 

89. The method of claim 88, wherein said expression system is a cellular 
expression system or a cell-free expression system. 

90. The method of claim 89, wherein said cellular expression system is an 
E. coli cellular expression system. 

91. The method of claim 89, wherein said cell-free expression system is an 
E. coli S30 extract expression system. 

92. The method of claim 88, wherein said polynucleotide comprises a 
promoter sequence being for directing said expression of said chimeric polypeptide. 

93. The method of claim 92, wherein said promoter sequence is inducible 
by isopropyl beta-D-thiogalactoside. 

94. The method of claim 83, wherein said autoprocessing segment is 
selected from the group consisting of BILl_cloth, BIL2_cloth, BIL3 cloth, 
BIL4_cloth, BIL5_cloth, BIL6_cloth, BBL7_cloth, BIL9_cloth, BIL10_cloth, 
BILll_cloth, 3875_87_magma, FhaB_manha, BIL2_neigo, BIL3_neigo, BIL5_neigo, 
BDL6_neigo, MafB l_neigo, MafB2_jieigo, B0369+_neimeB, B0372+_neimeB, 
B0655+_neimeB, A2 1 1 5_neime, BIL2_neimeC, BIL3_neimeC, BIL4_neimeC, 
BIL5_neimeC, BiIL6_neimeC, Ma£Bl_neimeC, FhaBl_psefl-PfO-l, FhaB l_psefl- 
SBW25, FhaB_psesy, SCP1.201_strco, 39_9_thefus, BH.l_gemob, BIL2_gemob, 
O709_lepin, 3725_lepin, 3719_lepin, o665_myxxa, ol078_myxxa, o 1 070_myxxa, 
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BILl_strav, BIL2_strav, BIL3_strav, BILl_pirsp, BILl_chrvi, BILlglovi, 
BIL2_glovi, BEL3_glovi, BIL4_glovi, BIL5_glovi, BIL6_glovi, BDL7_glovi, 
o649_versp, o5687_versp, o3395_versp, 1105 1 9_brume, BIL2_magma, BIL3_magma, 
BIL4_magma, BIL5_magma, BIL6_magma, 06786_metex, 00126_rhoca, 
00199_rhoca, 00459_rhoca, 00460__rhoca, 00746_rhoca, 00949_rhoca, 01216_rhoca, 
01374_rhoca, 01523_rhoca, 01524_rhoca, 02710_rhoca, 03530_rhoca, 4825_rhosp/ 
BIL2_rhosp, BIL ljsilpo, BIL2_silpo, BIL3_silpo, BEL4_silpo, BIL5_silpo, 
BIL6_silpo, BIL7__silpo, BIL8_silpo, BIL9_silpo, BIL10_silpo, BILll_silpo, 
BIL12_silpo 5 BIL13_silpo, BIL14_silpo, BIL15_silpo, BIL16_silpo, Bill_rhile, 
BILl_unknwn, and BIL2_unknwn. 

95. The method of claim 83, wherein said autoprocessing segment is 
derived from a protein of an organism belonging to a genus selected from the group 
consisting of Brucella, Clostridium, Magnetospirillum, Mannheimia, 
Methylobacterium, Neisseria, Pseudomonas, Rhodobacter, Silicibacter, Streptomyces, 
Tliermobifida, Rhizobium, Chromobacterium, Myxococcus, Leptospira, Pirellula, 
Gemmata, Gloeobacter and Verrucomicrobium. 



96. The method of claim 95, wherein said organism is selected from the 
group consisting of Rhodobacter capsulatus, Rhodobacter sphaeroides, Silicibacter 
pomeroyi, Brucella melitensis, Brucella suis, Magnetospirillum magnetotacticum, 
Methylobacterium extorquens, Rhizobium leguminosarum, Neisseria meningitidis, 
Neisseria meningitidis, Neisseria meningitidis, Neisseria gonorrhoeae, 
Chromobacterium violaceum, Pseudomonas syringae, Pseudomonas fluorescens, 
Pseudomonas fluorescens, Mannheimia haemolytica, Myxococcus xanthus, 
Leptospira interrogans, Streptomyces coelicolor, Streptomyces avermitilis, 
Tliermobifida fusca, Clostridium thermocellum, Pirellula species 1, Gemmata 
obscuriglobus, Gloeobacter violaceus, and Verrucomicrobium spinosum. 

97. The method of claim 83, wherein said auto-cleavage results in removal 
of a segment of said chimeric polypeptide adjacent to an amino terminal end or a 
carboxy terminal end of said autoprocessing segment. 
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98. The method of claim 97, wherein said segment of said chimeric 
polypeptide adjacent to said autoprocessing segment is an amino terminal segment or 
a carboxy terminal segment of said chimeric polypeptide. 

99. The method of claim 97, wherein said segment of said chimeric 
polypeptide adjacent to said carboxy terminal end of said autoprocessing segment 
includes an amino acid residue comprising a nucleophilic group at an amino terminal 
end thereof. 



1 00. The method of claim 99, wherein said nucleophilic group is a hydroxyl 

group. 



101. The method of claim 99, wherein said amino acid residue is a threonine 

■ 

residue. 



102. The method of claim 97, wherein said segment of said chimeric 
polypeptide adjacent to said amino terminal end of said autoprocessing segment 
includes a serine amino acid residue at a carboxy terminal end thereof. 

103. The method of claim 83, wherein said auto-cleavage results in auto- 
splicing. 

1 04. The method of claim 103, wherein said auto-splicing is auto-splicing of 
segments of said chimeric polypeptide flanking said autoprocessing segment. 

105. The method of claim 83, wherein said chimeric polypeptide is capable 
of said auto-cleavage under a condition selected from the group consisting of a 
temperature selected from a range of 33 °C to 41 °C, a pH selected from a range of pH 
7.8 to pH 8.2, and a concentration of dithiothreitol selected from a range of 0.1 mM to 
20 mM. 



106. A method of reversibly attaching a first substrate to a second substrate, 
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the method comprising: 

(a) providing a chimeric polypeptide including an autoprocessing segment 
having an amino acid sequence set forth by SEQ ID NO: 8, 9, 10, 11, 
12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 
31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 

. 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 
69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 
88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 
105 or 106 flanked by a first amino acid sequence capable of binding 
the first substrate and a second amino acid sequence capable of binding 
the second substrate, said chimeric polypeptide being capable of auto- 
cleavage when subjected to suitable conditions, to thereby release said 
first amino acid sequence from said second amino acid sequence; 

(b) exposing the first substrate and the second substrate to said chimeric 
polypeptide, thereby generating a complex including the first substrate 
attached via said chimeric polypeptide to the second substrate; and 

(c) subjecting said complex to said suitable conditions, thereby detaching 
the first substrate from the second substrate. 

107. The method of claim 106, wherein each of the first and second 
substrates is independently selected from the group consisting of a molecule, a 
compound, a virus, and a cell. 

108. The method of claim 107, wherein said molecule is amylose orchitin. 

109. The method of claim 107, wherein said virus is a bacteriophage. 

110. The method of claim 106, wherein said chimeric polypeptide includes 
an affinity tag sequence, and whereas said binding the first substrate or said binding 
the second substrate is via said affinity tag sequence. 



111. The method of claim 110, wherein said affinity tag sequence is a 
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maltose-binding domain or a chitin-binding domain. 



112. The method of claim 106, wherein said autoprocessing segment is 
selected from the group consisting of BELl_cloth, BIL2_cloth, BIL3_cloth, 
BIL4_cloth, BIL5_cloth, BIL6_cloth, BIL7_cloth, BIL9_cloth, BDL10_cloth, 
BIL1 l_cloth, 3875_87_magma, FhaBjmanha, BIL2_neigo, BIL3_neigo, BIL5_neigo, 
BIL6_neigo, Ma£Bl_neigo, MafB2_neigo, B0369+_neimeB, B0372+_neimeB, 
B0655+_neimeB, A2115_neime, BIL2_neimeC, BIL3_neimeC, BDL4_neimeC, 
BIL5_neimeC, BIL6_neimeC, MafB l_neimeC, FhaB l_psefl-PfO- 1 , FhaBl^psefl- 
SBW25, FhaB_psesy, SCP1.201_strco, 39_9_thefus, BDwl_gemob, BEL2_gemob, 
0709_lepin, 3725_lepin, 3719_lepin, o665_myxxa, ol078_myxxa, ol070_myxxa, 
BILl_strav, BIL2_strav, BIL3_strav, BILl_pirsp, BILl_chrvi, BILl_glovi, 
BIL2_glovi, BIL3_glovi, BIL4_glovi, BIL5_glovi, BDL6_glovi, BEL7_glovi, 
o649_versp, o5687_versp, o3395_versp, II0519_brume, BEL2_magma, BIL3_magma, 
BIL4_magma, BIL5_magma, BIL6_magma, 06786_metex, 00126_rhoca, 
00199_rhoca, 00459_rhoca, 00460_rhoca, 00746_rhoca, 00949_rhoca, 01216_rhoca, 
01374_rhoca, 01523_rhoca, 01524_rhoca, 02710_rhoca, 03530_rhoca, 4825_rhosp, 
BIL2_rhosp, BELl_silpo, BIL2_silpo, BDL3_silpo, BIL4_silpo, BIL5_silpo, 
BIL6_silpo, BIL7_silpo, BIL8_silpo, BIL9_silpo, BIL10_silpo, BILll_silpo, 
BIL12_silpo, BEL13_silpo, BIL14_silpo, BIL15_silpo, BIL16_silpo, Bill_rhile, 
BILl unknwn, and BIL2_unknwn. 

113. The method of claim 106, wherein said autoprocessing segment is 
derived from a protein of an organism belonging to a genus selected from the group 
consisting of Brucella, Clostridium, Magnetospirillum, Mannheimia, 
Methylobacterium, Neisseria, Pseudomonas, Rhodobacter, Silicibacter, Streptomyces, 

* 

Thermobifida, Rhizobium, Chromobacterium, Myxococcus, Leptospira, Pirellulaj 
Gemmata, Gloeobacter and Verrucomicrobium. 

* 

114. The method of claim 113, wherein said organism is selected from the 
group consisting of Rhodobacter capsulatus, Rhodobacter sphaeroides, Silicibacter 
pomeroyi, Brucella melitensis, Brucella suis, Magnetospirillum magnetotacticum, 
Methylobacterium extorquens, Rhizobium leguminosarum, Neisseria meningitidis, 
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Neisseria meningitidis, Neisseria meningitidis, Neisseria gonorrhoeae, 
Chromobacterium violaceum, Pseudomonas syringae, Pseudomonas fluorescens, 
Pseudomonas fluorescens, Mannheimia haemolytica, Myxococcus xanthus, 
Leptospira interrogans, Streptomyces coelicolor, Streptomyces avennitilis, 
Thermobifida fusca, Clostridium thermocellum, Pirellula species 1, Gemmata 
obscuriglobus, Gloeobacter violaceus, and Verrucomicrobium spinosum. 

* 

115. The method of claim 106, wherein said auto-cleavage results in 
removal of a segment of said chimeric polypeptide adjacent to an amino terminal end 
or a carboxy terminal end of said autoprocessing segment. 

116. The method of claim 115, wherein said segment of said chimeric 
polypeptide adjacent to said autoprocessing segment is an amino terminal segment or 
a carboxy terminal segment of said chimeric polypeptide. 

117. The method of claim 115, wherein said segment of said chimeric 
polypeptide adjacent to said carboxy terminal end of said autoprocessing segment 
includes an amino acid residue comprising a nucleophilic group at an amino terminal 
end thereof. 



118. The method of claim 117, wherein said nucleophilic group is a 
hydroxyl group. 



119. The method of claim 117, wherein said amino acid residue is a 
threonine residue. 



120. The method of claim 115, wherein said segment of said chimeric 
polypeptide adjacent to said amino terminal end of said autoprocessing segment 
includes a serine amino acid residue at a carboxy terminal end thereof. 

121. The method of claim 106, wherein said chimeric polypeptide is capable 
of said auto-cleavage under a condition selected from the group consisting of a 
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temperature selected from a range of 33 °C to 41 °C, a pH selected from a range of pH 
7.8 to pH 8.2, and a concentration of dithiothreitol selected from a range of 0.1 mM to 
20 mM. 
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Fig. 4a 




A2115+ nei 
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Fig. 4b 
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Fig. 5a 



mw - MBC MC 




MBP-PsyBIL-CBD (66.3 kDa) 
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Expressed protein: 
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Fig. 9 
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Fig. 10a 



Fig. 10b 



Fig. 10c 
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Fig. 12a 



Fig. 12b 



Anti-C 



Anti-M 



Total 



MC 



(~ 50 kDa) 





Total 





MB 

MC 

(~ 50 kDa) 
M 



Fig. 12c 



Total 

lit 




!:i5:.i-..'v J .1 




* i rf ( r*. \ 
I « ft, ' 



: a->c 

■v. s v.;; 



t. ■ 



. ■ » 

1 



»-.(. . . 



:<.,- "... • - J:- , • 

»-"■*»• j.,Vwj4f ■* ; " i 
■ '■ 



I: ' 4 




j' Vr, ■ . ! 



! 



- :« - ■■ *■ 




i ' : 

V * > J ' - ' Ol 



l- 
i 



In 



Control 
(no BIL) 



5 ■. Vr.» : ■ ^ 




■ ■,•4'.' • -I 



■j ! 



* : * . v ' - -.1 



WO 2004/043997 



1 

SEQUENCE LISTING 

<110> Pietrokovski, Shmuel 
Amitai, Gil 

<120> CHIMERIC AUTOPROCESSING POLYPEPTIDES AND USES THEREOF 
<130> 26577 
<160> 109 

<170> Patentln version 3.2 

<210> 1 

<211> 25 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 1 

aaatgtcgac tgcggtggcc tgacc 

<210> 2 

<211> 25 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 2 

tgtcgtattg cttcctttcg ggctt 

<210> 3 

<211> 33 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 3 

tgcggtggcc tgaccggtct gaactcaggc etc 



<210> 4 

<211> 29 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 4 

aaaaggatcc tgetttgegg ceggaaega 

<210> 5 

<211> 32 

<212> DNA 

<213> artificial sequence 
• <220> 

<223> Single strand DNA oligonucleotide 



<400> 5 

aaaatctaga ggtattatgc acccatgtct tg 
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2 

<210> 6 

<211> 27 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 



<400> 


6 


gaattcggtg attcatcctt ggggcga 


<210> 


7 


<211> 


28 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<223> 


Single strand DNA oligonucleotide 


<400> 


7 



tctagaaaaa cacggcaagg gcgagcgg 



<210> 8 

<211> 143 

<212> PRT 

<213> Neisseria meningitidis 



<220> 

<221> misc_feature 

<222> (36).. (62) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> miscjleature 

<222> (78).. (80) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (110) . . (113) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (129) . . (135) 

<223> Xaa can be any naturally occurring amino acid 

<400> 8 

Ser Phe His Gly Ser Thr Leu Val Lys Thr Ala Asp Gly Tyr Lys Ala 
1 5 10 15 



lie Ala His He Gin Ala Gly Asp Arg Val Phe Ala Lys Asp Glu Thr 

20 25 30 



Ser Gly Lys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn Asn 
50 55 60 




Gin Thr Leu He Ser Asn Lys He His Pro Phe Tyr Ser Xaa Xaa Xaa 
65 70 75 80 
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Tip He Gin Ala Gly Arg Leu Lys Lys Gly Asp Thr Leu Leu Ser Glu 

85 90 " 95 



Ser Gly Ala Lys Gin Thr Val Gin Asn He Thr Leu Lys Xaa Xaa Xaa 

100 105 ~ 110 



Xaa Lys Ala Tyr Asn Leu Thr Val Ala Asp Trp His Thr Tyr Phe Val 
115 120 125 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Gly Val Trp Val His Asn Glu 
130 135 140 



<210> 9 

<211> 143 

<212> PRT 

<213> Neisseria meningitidis (group B) 



<220> 

<221> misc_feature 

<222> (36).. (62) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (78).. (80) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (110) . . (113) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (129) . . (135) 

<223> Xaa can be any naturally occurring amino acid 

<400> 9 

Ser Phe His Gly Ser Thr Leu Val Lys Thr Ala Asp Gly Tyr Lys Ala 
1 5 10 15 



He Ala Arg He Arg Thr Gly Asp Arg Val Phe Ala Lys Asp Glu Ala 

20 25 30 



Ser Gly Lys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn Asn 
50 55 60 



Gin Thr Leu He Ser Asn Lys He His Pro Phe Tyr Ser Xaa Xaa Xaa 
65 70 75 80 



Trp He Glri Ala Gly Arg Leu Lys Lys Gly Asp Thr Leu Leu Ser Glu 

85 90 95 



Ser Gly Ala Lys Gin Thr Val Gin Asn He Thr Leu Lys Xaa Xaa Xaa 

100 105 110 



WO 2004/043997 



* 



Xaa Lys Ala Tyr Asn Leu Thr Val Ala Asp Trp His Thr Tyr Phe Val 
115 120 * 125 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Gly Val Trp Val His Asn Asp 
130 135 140 



<210> 10. 

<211> 143 

<212> PRT 

<213> Neisseria meningitidis (group B) 
<220> 

<221> misc_f eature 

<222> (36).. (62) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (78).. (80) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (110) . . (113) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (129) . . (135) 

<223> Xaa can be any naturally occurring amino acid 

<400> 10 

Ser Phe His Gly Ser Thr Leu Val Lys Thr Ala Asp Gly Tyr Lys Ala 
1 5 10 15 



lie Ala Arg lie Arg Thr Gly Asp Arg Val Phe Ala Lys Asp Glu Ala 

20 25 30 



Ser Gly Lys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn Asn 
50 55 60 



Gin Thr Leu lie Ser Asn Lys lie His Pro Phe Tyr Ser Xaa Xaa Xaa 
65 70 75 80 



Trp He Gin Ala Gly Arg Leu Lys Lys Gly Asp Thr Leu Leu Ser Glu 

85 90 95 



Ser Gly Ala Lys Gin Thr Val Gin Asn He Thr Phe Lys Xaa Xaa Xaa 

100 105 lib 



Xaa Lys Ala Tyr Asn Leu Thr Val Ala Asp Trp His Thr Tyr Phe Val 
115 120 125 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Gly Val Trp Val His Asn Asp 
130 a35 140 



<210> 11 
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<211> 142 
<212> PRT 

<213> Magnetospirillum magnetotacticum 



<220> 

<221> misc — feature 

<222> (14) . . (14) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature . 

<222> (37) . (53) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (69) . . (85) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (101) . . (119) 

<223> Xaa can be any naturally occurring amino acid 

<400> 11 

Cys Phe Val Ala Gly Thr Pro Val Arg Met Ala Asp Gly Xaa Glu Lys 
1 5 10 15 



Ala lie Glu Thr Val Glu lie Gly Glu Gin Val Gin Gly Thr Asp Gly 

20 25 30 

Thr lie Asn Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Asn Ser Leu Asp Phe Phe Val Thr Ala Asp His 
50 55 60 



Pro Phe Leu Thr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
65 70 75 80 



Xaa Xaa Xaa Xaa Xaa Ala Leu Asn Val Thr Gin Leu Val lie Gly Asp 

85 90 95 



Thr Leu He Thr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

100 105 110 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val Val Tyr Asn Leu His Leu He Gly 
115 120 125 



Asn Asn Thr Tyr Val Ala Ser Gly Tyr Tyr Val His Asn Tyr 
130 135 140 



<210> 12 

<211> 149 

<212> PRT 

<213> Pseudomonas syringae 



<220> 

<221> misc_feature 

<222> (36).. {66) 

<223> Xaa can be any naturally occurring amino acid 
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<220> 

<221> misc__f eature 

<222> (82).. (86) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc^f eature 

<222> (102).. (124) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc^f eature 

<222> (140) . . (141) 

<223> Xaa can be any naturally occurring amino acid 

<400> 12 

Cys Phe Ala Ala Gly Thr Met Val Ser Thr Pro Asp Gly Glu Arg Ala 
15 10 15 



lie Asp Thr Leu Lys Val Gly Asp lie Val Trp Ser Lys Pro Glu Gly 

20 25 30 



Gly Gly Lys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
50 55 60 



Xaa Xaa Glu Asp Glu Ser Leu Leu Val Thr Pro Gly His Pro Phe Tyr 
65 70 75 80 



Val Xaa Xaa Xaa Xaa Xaa Phe Val Pro Val lie Asp Leu Lys Pro Gly 

85 90 95 



Asp Arg Leu Gin Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

100 105 110 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Lys Thr Tyr Asn 
115 120 125 



Leu Thr Val Asp Val Gly His Thr Phe Tyr Val Xaa Xaa Leu Lys Thr 
130 135 140 



Trp Val His Asn Thr 
145 



<210> 13 

<211> 149 

<212> PRT 

<213> Pseudomonas fluorescens 



<220> 

<221> misc_f eature 

<222> (36).. (66) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (82) . . (86) 

<223> Xaa can be any naturally occurring amino acid 
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<220> 

<221> misc_feature 

<222> (102) . . (124) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc__f eature 

<222> (140) . . (141) 

<223> Xaa can be any naturally occurring amino acid 

<400> 13 

* » 

Cys Phe Ala Ala Gly Thr Met Val Ala Thr Pro Lys Gly Glu Arg Ala 
15 10 15 



lie Glu Thr Leu Lys lie Gly Asp Val Val Trp Ser Lys Pro Glu Gin 

20 25 30 



Gly Gly Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
50 55 60 



Xaa Xaa Ser Ser Glu Thr Leu Glu Val Thr Pro Gly His Pro Phe Tyr 
65 70 75 80 



Val Xaa Xaa Xaa Xaa Xaa Phe Val Pro Leu He Glu Leu Gin Pro Gly 

85 90 95 



Asp Arg Leu Gin Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

100 105 110 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Arg Thr Tyr Asn 
115 120 125 



Leu Thr Val Asp He Gly His Thr Phe Tyr Val Xaa Xaa Leu Gly Thr 
130 135 140 



Trp Val His Asn Val 
145 



<210> 14 

<211> 149 

<212> PRT 

<213> Pseudomonas fluorescens 



<220> 

<221> misc__feature 

<222> (36)7! (66) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> jnisc_f eature 

<222> (82).. (86) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<;221> mis cofeature 

<222> (102) . . (124) 

<223> Xaa can be any naturally occurring amino acid 
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<220> 

<221> misc_feature 
<222> (140) . . (141) 

<223> Xaa can be any naturally occurring amino acid 
<400> 14 

Cys Phe Ala Ala Gly Thr Met Val Ala Thr Pro Ser Gly -Asp Arg Ala 
15 10 15 



lie Asp Thr :Leu Lys Val Gly Glu lie Val Trp Ser Lys Pro Glu His 

20 25 30 



Gly Gly Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
50 55 60 



Xaa Xaa Glu Gly Glu Thr Leu Leu Val Thr Pro Ser His Pro Phe Tyr 
65 70 75 80 



Val Xaa Xaa Xaa Xaa Xaa Phe Val Pro Ala lie Asn Leu Lys Pro Gly 

85 90 95 



Asp Leu Leu Gin Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

100 105 110 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Lys Thr Phe Asn 
115 120 125 



Leu Thr Val Asp lie Gly His Thr Phe Tyr Val Xaa Xaa Leu Lys Thr 
130 135 140 



Trp Val His Asn Thr 
145 



<210> 15 

<211> 140 

<212> PRT 

<213> Streptomyces coelicolor 



<220> 

<221> misc_feature 

<222> (14).. (14) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mis cofeature 

<222> (37),. (60) 

<223> Xaa can be any ' naturally occurring amino. acid 
<220> 

<221> misc_feature 

<222> (76) . . (80) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (110) . . (113) 

<223> Xaa can be any naturally occurring amino acid 
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<220> 

<221> misc_feature 
<222> (129) . . (132) 

<223> Xaa can be any naturally occurring amino acid 
<400> 15 

Ser Phe Pro Ala Gly Thr Arg Val Leu Met Ala Asp Gly Xaa Arg Arg 
15 10 15 



Ser lie Glu ,Gln lie Glu Ala Gly Asp Leu Val Thr Ala Thr Asp Pro 

20 25 30 



Thr Thr Gly Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asp Gly Ser Thr 
50 55 60 



Leu Thr Ser Thr Thr His His Pro Tyr Trp Ser Xaa Xaa Xaa Xaa Xaa 
65 70 75 80 



Trp Lys Asn Ala Gly Asp Leu Glu Ala Gly Asp Thr Leu Arg Thr Pro 

85 90 95 



Gin Asn Thr Ala Val Val lie Ala Ala Thr His Asp Trp Xaa Xaa Xaa 

100 105 110 



Xaa Asp Ala Tyr Asp Leu Thr Val Asp Gly Phe His Ser Tyr Tyr Val 
115 120 125 



Xaa Xaa Xaa Xaa Thr Asp Val Leu Val His Asn Asn 
130 135 140 



<210> 16 

<211> 145 

<212> PRT 

<213> Thermobifida fusca 



<220> 

<221> misc_feature 

<222> (14) . . (14) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (37) . . (64) 

<;223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (80) . . (84) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_ feature 

<222> (114) . . (118) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (134) . . (137) 
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<223> Xaa can be any naturally occurring amino acid 
<400> 16 

Ser Phe Val Pro Gly Thr Leu Val Leu Leu Ala Asp Gly Xaa Tyr Ala 
15 io 15 



Pro He Glu Thr He Thr Val Gly Asp Asp Val Trp Ala Phe Asp Pro 

20 25 30 



Arg Thr Gly ; Thr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
50 55 60 



His Gly Gly Val Val Val Ala Thr Asp Ala His Pro Phe Trp Val Xaa 
65 70 75 80 



Xaa Xaa Xaa Xaa Trp Val Ala Ala He Asp Leu Glu Pro Gly Thr Trp 

85 90 95 



Leu Arg Thr Ser Ala Gly Thr Trp Val Gin Val Arg Ala Val Ala Val 

100 105 110 



Arg Xaa Xaa Xaa Xaa Xaa Arg Val His Asn Leu Thr Val Ala Asp Leu 
115 120 125 



His Thr Tyr Tyr Val Xaa Xaa Xaa Xaa Ala Asp Ala Leu Val His Asn 
130 135 140 



Glu 
145 



<210> 17 

<211> 130 

<212> PRT 

<213> Neisseria gonorrhoeae 



<220> 

<221> misc feature 

<222> (23)7. (49) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (65) . . (67) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<22l> mis cofeature 

<222> (97) . . (100) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (116) . . (122) 

<223> Xaa can be any naturally occurring amino acid 

<400> 17 
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Tyr Lys Ala lie Ala His lie Gin Ala Gly Asp Arg Val Leu Ser Lys 
15 10 15 



Asp Glu Ala Ser Gly Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

20 25 30 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Asn Ser ,-Gln Thr Leu lie Ser Asn Arg lie His Pro Phe Tyr Ser 
50 55 60 



Xaa Xaa Xaa Trp lie Lys Ala Glu Asp Leu Lys Ala Gly Ser Arg Leu 
65 70 75 80 



Leu Ser Glu Ser Gly Lys Thr Gin Thr Val Arg Asn lie Val Val Lys 

85 90 95 



Xaa Xaa Xaa Xaa Lys Ala Tyr Asn Leu Thr Val Ala Asp Trp His Thr 

100 105 110 



Tyr Phe Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Gly Val Trp Val His 
115 120 125 



Asn Asp 
130 



<210> 18 

<211> 143 

<212> PRT 

<213> Neisseria gonorrhoeae 



<220> 

<221> misc_f eature 

<222> (36).. (62) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (78).. (80) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (110) . . (113) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (129) . . (135) ' 

<223> Xaa can be any naturally occurring amino acid 

<400> 18 

Pro Phe His Gly Ser Thr Leu Val Lys Thr Ala Asp Gly Tyr Lys Ala 
1 5 10 15 



lie Ala Arg He Arg Val Gly Asp His Val Phe Ala Lys Asp Glu Ala 

. 20 25 30 
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Ser Gly Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn Asn 
50 55 60 



Gin Thr Leu lie Ser Asn Arg lie His Pro Phe Tyr Ser Xaa Xaa Xaa 
65 70 75 80 



Trp lie Lys .Ala Glu Asp Leu Lys Ala Gly Ser Arg Leu Leu Ser Glu 

85 90 95 



Ser Gly Arg Thr Gin Thr Val Arg Asn lie lie Val Lys Xaa Xaa Xaa 

100 105 110 



Xaa Lys Ala Tyr Asn Leu Thr Val Ala Asp Trp His Thr Tyr Phe Val 
115 120 125 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Gly Val Trp Val His Asn Ala 
130 135 140 



<210> 19 

<211> 143 

<212> PRT 

<213> Neisseria gonorrhoeae 
<220> 

<221> misc_feature 

<222> (36) . . (62) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_ feature 

<222> (78) . . (80) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (110) . . (113) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc__feature 

<222> (129) . . (135) 

<223> Xaa can be any naturally occurring amino acid 

<400> 19 

Ser Phe His Gly Ser Thr Leu Val Arg Thr Ala Asp Gly Tyr Lys Ala 
15 10 ' 15 



He Ala His He Gin Ala Gly Asp Arg Val Leu Ser Lys Asp Glu Ala 

20 25 30 



Ser Gly Lys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 '45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn Ser 
50 55 60 
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Gin Thr Leu lie Ser Asn Arg lie His Pro Phe Tyr Ser Xaa Xaa Xaa 
65 70 75 80 



Trp lie Lys Ala Glu Asp Leu Lys Ala Gly Asn Arg Leu Phe Ala Glu 

85 90 95 



Ser Gly Lys Thr Gin Thr Val Arg Asn He Val Val Lys Xaa Xaa Xaa 

100 105 110 



Xaa Lys Ala ;Tyr Asn Leu Thr Val Ala Asp Trp His Thr Tyr Phe Val 
115 120 * 125 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Gly Val Trp Val His Asn Ser 
130 135 140 



<210> 20 

<211> 143 

<212> PRT 

<213> Neisseria gonorrhoeae 



<220> 

<221> misc_feature 

<222> (36) . . (62) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (78) . . (80) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc__feature 

<222> (110) . . (113) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (129) . . (135) 

<223> Xaa can be any naturally occurring amino acid 

<400> 20 

Ser Phe His Gly Ser Thr Leu Val Lys Thr Ala Asp Gly Tyr Lys Ala 
1 5 10 15 



He Ala His He Gin Ala Gly Asp Arg Val Leu Ser Lys Asp Glu Ala 

20 25 30 



Ser Gly Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 .45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn Ser 
50 55 . 60 



Gin Thr Leu He Ser Asn Arg He His Pro Phe Tyr Ser Xaa Xaa Xaa 
65 70 75 80 



Trp He Lys Ala Glu Asp Leu Lys Ala Gly Ser Arg Leu Leu Ser Glu 

85 90 95 
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Ser Gly Lys Thr Gin Thr Val Arg Asn He Val Val Lys Xaa Xaa Xaa 

100 105 ^ 110 



Xaa Lys Ala Tyr Asn Leu Thr Val Ala Asp Trp His Thr Tyr Phe Val 
115 120 125 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Gly Val Trp Val His Asn Asp 
130 135 140 



<210> 21 

<211> 143 

<212> PRT 

<213> Neisseria gonorrhoeae 



<220> 

<221> miscjFeature 

<222> (36) . . (62) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (78) . . (80) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (110).. (113) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (129) . . (135) 

<223> Xaa can be any naturally occurring amino acid 

<400> 21 

Ser Phe His Gly Ser Thr Leu Val Lys Thr Ala Asp Gly Tyr Lys Ala 
1 5 10 15 

He Ala His lie Gin Ala Gly Asp Arg Val Leu Ser Lys Asp Glu Ala 

20 25 * 30 

Ser Gly Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn Ser 
50 55 60 



Gin Thr Leu He Ser Asn Arg He His Pro Phe Tyr Ser Xaa Xaa Xaa 
65 70 75 80 

Trp He Lys Ala Glu Asp Leu Lys Ala Gly Ser Arg Leu Phe Ala Glu 

85 90 95 

Ser Gly Lys Thr Gin Thr Val Arg Asn He He Val Lys Xaa Xaa Xaa 

100 105 110 

Xaa Lys Ala Tyr Asn Leu Thr Val Ala Asp Trp His Thr Tyr Phe Val 
115 120 125 
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Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Gly val Trp Val His flsn Asp 
130 135 140 



<210> 22 

<211> 143 

<212> PRT 

<213> Neisseria gonorrhoeae 



<220> 

<221> misc_ feature 

<222> (36) ... (62) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc — feature 

<222> (78) . . (80) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_ feature 

<222> (110) . . (113) 

<223> Xaa can be any naturally occurring amino acid 

<220> " 
<22l> misc_feature 
<222> (129) . . (135) 

<223> Xaa can be any naturally occurring amino acid 
<400> 22 

Pro Phe His Gly Ser Thr Leu Val Lys Thr Ala Asp Gly Tyr Lys Ala 
15 10 15 

He Ala His He Gin Thr Gly Glu His Val Phe Ala Lys Asp Glu Thr 

20 25 30 

Ser Gly Lys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn Ser 
50 55 60 



Gin Thr Leu He Ser Asn Arg He His Pro Phe Tyr Ser Xaa Xaa Xaa 
65 70 75 80 

Trp lie Lys Ala Glu Asp Leu Lys Ala Gly Ser Arg Leu Leu Ser Glu 

85 90 " 95 

Ser Gly Arg Thr Gin Thr Val Arg Asn Thr Val Val Lys Xaa Xaa Xaa 

100 105 110 

Xaa Lys Ala Tyr Asn Leu Thr Val Ala Asp Trp His Thr Tyr Phe Vpl 
115 120 125 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Gly Val Trp Val His Asn Ser 
130 135 140 



<210> 23 

<211> 143 

<212> PRT 

<213> Neisseria meningitidis (group C) 
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<220> 

<221> misc_feature 

<222> (36).. (62) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (78).. (80) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mis cofeature 

<222> (110) . . (113) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_ feature 

<222> (129) . . (135) 

<223> Xaa can be any naturally occurring amino acid 

<400> 23 

Ser Phe His Gly Ser Thr Leu Val Lys Thr Ala Asp Gly Tyr Lys Ala 
1 5 10 15 

He Ala His He Arg Val Gly Glu Ser Val Phe Ala Lys Asp Glu Thr 

20 25 " 30 



Ser Gly Lys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn Ser 
50 55 60 



Gin Thr Leu He Ser Asn Arg lie His Pro Phe Tyr Ser Xaa Xaa Xaa 
65 70 75 80 

Trp He Gin Ala Gly Arg Leu Lys Lys Gly Asp Thr Leu Leu Ser Glu 

85 90 95 



Ser Gly Ala Lys Gin Thr Val Gin Asn He Thr Leu Lys Xaa Xaa Xaa 

100 105 " HO 



Xaa Lys Ala Tyr Asn Leu Thr Val Ala Asp Trp His Thr Tyr Phe Val 
115 120 125 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Gly Val Trp Val His Asn Asp 
130 135 140 



<210> 24 

<211> 143 

<212> PRT 

<213> Neisseria meningitidis (group C) 



<220> 

<221> misc_feature 

<222> (36) . . (62) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> " misc_feature 

<222> (78).. (80) 



WO 2004/043997 




17 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 
<222> (110) . . (113) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 
<222> (129) . . (135) 

<223> Xaa can be any naturally occurring amino acid 
<400> 24 

Pro Leu Tyr Val Gly Ala Leu Val Lys Thr Ala Asp Gly Tyr Lys Ala 
15 10 15 



He Ala His He Arg Val Gly Glu Ser Val Leu Ser Lys Asp Glu Ala 

20 25 30 



Ser Gly Lys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn Ser 
50 55 60 



Gin Thr Leu He Ser Asn Arg He His Pro Phe Tyr Ser Xaa Xaa Xaa 
65 70 75 80 



Trp He Gin Ala Gly Arg Leu Lys Lys Gly Asp Thr Leu Leu Ser Glu 

85 90 95 



Ser Gly Ala Lys Gin Thr Val Gin Asn He Thr Phe Lys Xaa Xaa Xaa 

100 105 110 



Xaa Lys Ala Tyr Asn Leu Thr Val Ala Asp Trp His Thr Tyr Phe Val 
115 120 125 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Gly Val Trp Val His Asn Ala 
130 135 140 



<210> 25 

<211> 143 

<212> PRT 

<213> Neisseria meningitidis (group C) 



<220> 

« 

<221> misc_f eature 

<222> (36).. (62) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (78).. (80) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (110) . . (113) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc feature 
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<222> (129) . . (135) 

<223> Xaa can be any naturally occurring amino acid 
<400> 25 

Ser Phe His Gly Ser Thr Leu Val Lys Thr Ala Asp Gly Tyr Lys Ala 
15 10 15 



lie Ala His lie Arg Val Gly Glu Ser Val Leu Ser Lys Asp Glu Ala 

20 25 30 



Ser Gly Lys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn Ser 
50 55 60 



Gin Thr Leu Val Ser Asn Lys He His Pro Phe Tyr Ser Xaa Xaa Xaa 
65 70 75 80 



Trp He Lys Ala Glu Asp Leu Lys Ala Gly Ser Arg Leu Leu Ser Glu 

85 90 95 



Ser Gly Lys Thr Gin Thr Val Arg Asn He Val Val Lys Xaa Xaa Xaa 

100 105 110 



Xaa Lys Ala Tyr Asn Leu Thr Val Ala Asp Trp His Thr Tyr Phe Val 
115 120 125 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Gly Val Trp Val His Asn Ala 
130 135 140 



<210> 26 

<211> 143 

<212> PRT 

<213> Neisseria meningitidis {group C) 



<220> 

<221> misc_feature 

<222> (36).. (62) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (78).., (80) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc feature 

<222> (1107..(H3) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (129) . . (135) 

<223> Xaa can be any naturally occurring amino acid 

<400> 26 



Ser Phe His Gly Ser Thr Leu Val Lys Thr Ala Asp Gly Tyr Lys Ala 
15 10 15 
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He Ala Arg He Arg Thr Gly Asp Arg Val Phe Ala Lys Asp Glu Ala 

20 25 30 



Ser Gly Lys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn Asn 
50 55 60 



Gin Thr Leu : Ile Ser Asn Lys He His Pro Phe Tyr Ser Xaa Xaa Xaa 
65 70 75 80 



Trp He Gin Ala Gly Arg Leu Lys Lys Gly Asp Thr Leu Leu Ser Glu 

85 90 95 



Ser Gly Ala Lys Gin Thr Val Gin Asn He Thr Phe Lys Xaa Xaa Xaa 

100 105 110 



Xaa Lys Ala Tyr Asn Leu Thr Val Ala Asp Trp His Thr Tyr Phe Val 
115 120 125 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Gly Val Trp Val His Asn Ala 
130 135 140 



<210> 27 

<211> 143 

<212> PRT 

<213> Neisseria meningitidis (group C) 
<220> 

<221> misc_f eature 

<222> (36) . . (62) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<22l> misc_jf eature 

<222> (78) . . (80) ' 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (110) . . (113) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc f eature 

<222> (129) . . (135) 

<223> Xaa can be any naturally occurring amino acid 

<400> 27 

Ser Phe His Gly Ser Thr Leu Val Lys Thr Ala Asp Gly Tyr Lys Ala 
1 5 10 " J "* 15 

He Ala His He Gin Ala Gly Asp Arg Val Leu Ser Lys Asp Glu Ala 

20 25 ^ 30 



Ser Gly Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 
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Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn Ser 
50 55 60 



Gin Thr Leu Val Ser Asn Lys lie His Pro Phe Tyr Ser Xaa Xaa Xaa 
65 70 75 80 



Trp lie Gin Ala Gly Arg Leu Lys Lys Gly Asp Thr Leu Leu Ser Glu 

85 90 95 



Ser Gly Ala ; Lys Gin Thr Val Gin Asn He Thr Leu Lys Xaa Xaa Xaa 

100 105 110 



Xaa Lys Ala Tyr Asn Leu Thr Val Ala Asp Trp His Thr Tyr Phe Val 
115 120 125 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Gly Val Trp Val His Asn Ser 
130 135 140 



<210> 28 

<211> 164 

<212> PRT 

<213> Mannheimia haemolytica 



<220> 

<221> misc_feature 

<222> -(36).. (62) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (78).. (101) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (131) . . (134) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (150) . . (156) 

<223> Xaa can be any naturally occurring amino acid 

<400> 28 

Ser Phe His Gly Asp Met Glu Val Lys Thr Asp Lys Gly Tyr Arg Gin 
1 5 10 15 



He Ser Ser He Lys Val Gly Asp Lys Val Leu Ala Lys Asn Glu Arg 

20 25 30 



Thr Gly He Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Lys Tyr 
50 55 60 



His Thr He Val Ser Asn Lys He His Pro Phe Phe Thr Xaa Xaa Xaa 
65 70 75 80 
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Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

85 90 95 



Xaa Xaa Xaa Xaa Xaa Trp Val Asp Ala Gin His Leu Gin Lys Gly Tyr 

100 105 110 



Arg Leu Leu Ala Glu Ser Gly Glu Trp Gin Thr Val Thr Lys Val Lys 
115 120 125 



lie Lys Xaa Xaa Xaa Xaa Lys Ala Tyr Asn Met Thr Val Glu Lys Asp 
130 135 140 



His Thr Tyr Phe lie Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Gly Val Trp 
145 * 150 155 160 



Val His Asn Asp 



<210> 29 

<211> 136 

<212> PRT 

<213> Clostridium thermocellum 



<220> 

<221> misc_feature 

<222> (36) . . (57) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc__f eature 

<222> (73) . . (76) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (106).. (Ill) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_£ eature 

<222> (127) . . (128) 

<223> Xaa can be any naturally occurring amino acid 

<400> 29 

Cys Phe Val Ala Gly Thr Leu He Leu Thr Val Ala Gly Leu Val Ala 
1 5 .10 15 



He Glu Asn He Lys Ala Gly Asp Lys Val He Ala Thr Asn Leu Glu 

20 " 25 30 



Thr Phe Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn Gly Glu Val He Lys Thr 
50 55 60 



Thr Phe Glu His Pro Phe Tyr Val Xaa Xaa Xaa Xaa Phe Val Glu Ala 
65 70 75 80 
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Lys Glu Leu Gin Val Gly Asp Lys Leu Leu Asp Ser Lys Gly Asn Val 

85 90 95 



Leu Val Val Glu Glu Lys Lys Leu Glu Xaa Xaa Xaa Xaa Xaa Xaa Lys 

100 105 110 



Yal Tyr Asn Phe His Val Asp Asp Phe Tyr Thr Tyr His Val Xaa Xaa 
115 120 125 



Asn Gly He leu Val His Asn Ala 
130 135 



<210> 30 

<211> 136 

<212> PRT 

<213> Clostridium thermocellum 



<220> 

<221> misc_feature 

<222> (36) . . (57) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc__f eature 

<222> (73) . . (76) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (106) . . (Ill) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (127) . . (128) 

<223> Xaa can be any naturally occurring amino acid 

<400> 30 

Cys Phe Val Ala Gly Thr Met Val Leu Thr Ala Ala Gly Leu Val Ala 
15 10 15 



He Glu Asn He Lys Val Gly Asp Lys Val He Ala Ala Asn Pro Glu 

20 ,25 30 



Thr Phe Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Gly Glu Val He Lys Thr 
50 55 60 



Thr Val Asp His Pro Phe Tyr Val Xaa Xaa Xaa Xaa Phe Val Glu Ala 
65 70 75 80 



Val Asn Leu Gin Val Gly Asp Lys Leu Val Asp Ser Lys Gly Asn Val 

85 90 95 



Leu Val Val Glu Glu Lys Lys Leu Lys Xaa Xaa Xaa Xaa Xaa Xaa Lys 

100 105 110 
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Val Tyr Asn Phe Lys Val Asp Asp Phe His Thr Tyr His Val Xaa Xaa 
115 120 125 



Lys Gly lie Leu Val His Asn Ala 
130 135 



<210> 31 

<211> 136 

<212> PRT 

<213> Clostridium thermocellum 



<220> 

<221> misc_feature 

<222> (36).. (57) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (73).. (76) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (106) . . (Ill) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc — feature 

<222> (127) . . (128) 

<223> Xaa can be any naturally occurring amino acid 

<400> 31 

Cys Phe Val Ala Gly Thr Met lie Leu Thr Ala Thr Gly Leu Val Ala 
1 5 10 15 

lie Glu Asn lie Lys Ala Gly Asp Lys Val He Ala Thr Asn Pro Glu 

20 25 30 



Thr Phe Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Gly Glu Val He Lys Thr 
50 55 " 60 



Thr Phe Asp His Pro Phe Tyr Val Xaa Xaa Xaa Xaa Phe Val Glu Ala 
65 70 75 80 



Gly Lys Leu Gin Val Gly Asp Lys Leu Leu Asp Ser Arg Gly Asn Val 

85 90 95 



Leu Val Val Glu Glu Lys Lys Leu Glu Xaa Xaa Xaa Xaa Xaa Xaa Lys 

100 105 HO 



Val Tyr Asn Phe Lys Val Asp Asp Phe His Thr Tyr His Val Xaa Xaa 
115 120 125 



Ash Glu Val Leu Val His Asn Ala 
130 135 



<210> 32 
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<211> 136 
<212> PRT 

<213> Clostridium thermocellum 



<220> 

<221> misc_feature 

<222> (36) . . (57) 

<223> Xaa can be any -naturally occurring amino acid 
<220> 

<221> misc_ feature 

<222> (73) , .(76) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (106) . . (Ill) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (127) . . (128) 

<223> Xaa can be any naturally occurring amino acid 

<400> 32 

Cys Phe Val Ala Gly Thr Met He Leu Thr Thr Thr Gly Leu Val Ala 
1 5 10 15 



He Glu Asn He Lys Ala Gly Asp Lys Val He Ala Thr Asn Pro Glu 

20 25 30 



Thr Phe Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Gly Glu Val He Lys Thr 
50 55 60 



Thr Phe Asp His Pro Phe Tyr Val Xaa Xaa Xaa Xaa Phe Val Glu Ala 
65 70 75 80 



Lys Gin Leu His Val Gly Asp Lys Leu Leu Asp Ser Lys Gly Asn Val 

85 90 " 95 



Leu Val Val Glu Asp Lys Lys He Lys Xaa Xaa Xaa Xaa Xaa Xaa Lys 

100 105 110 



Val Tyr Asn Phe Gin Val Ala Asp Phe His Thr Tyr His Val Xaa Xaa 
115 120 125 



Asn Gly Val Leu Val His Asn Val 
130 135 



<210> 33 

<211> 136 

<212> PRT, 

<213> Clostridium thermocellum 



<220> 

<221> misc_feature 

<222> (36).. (57) 

<223> Xaa can be any naturally occurring amino acid 
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<220> 

<221> misc_feature 
<222> (73).. (76) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 
<222> (106).. (Ill) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_:feature 
<222> (127) . . (128) 

<223> Xaa can be any naturally occurring amino acid 
<400> 33 

Cys Phe Val Ala Gly Thr Met He Leu Thr Val Ala Gly Leu Val Ala 
15 10 15 

He Glu Asn He Lys Ala Gly Asp Lys Val He Ala Thr Asn Pro Glu 

20 25 30 



Thr Phe Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn Gly Asp Val He Lys Thr 
50 55 60 



Thr Phe Glu His Leu Phe Tyr Ala Xaa Xaa Xaa Xaa Phe Val Glu Ala 
65 70 75 80 

Lys Glu Leu Gin Val Gly Asp Lys Leu Leu Asp Ser Lys Gly Asn Val 

85 90 95 

Leu Val Val Glu Asp Lys Lys He Lys Xaa Xaa Xaa Xaa Xaa Xaa Lys 

100 105 HO 



Val Tyr Asn Phe Gin Val Asp Asp Phe His- Thr. Tyr His Val Xaa Xaa 
115 120 ~ 125 



Asn Gly Val Leu Val His Asn Val 
130 135 



<210> 34 

<211> 136 

<212> PRT 

<213> Clostridium thermocellum 



<220> 

<221> misc feature 

<222> (36)7.(57) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

•<221> misc__f eature 

<222> (59).. (59) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc__f eature 

<222> (73) . . (76) 
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<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 
<222> (101) . . (101) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_£eature 
<222> (106) . . (Ill) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 
<222> (127) . . (128) 

<223> Xaa can be any naturally occurring amino acid 
<400> 34 

Cys Phe Val Ala Gly Thr Met lie Leu Thr Ala Thr Gly Leu Val Ala 
15 10 15 



lie Glu Asn lie Lys Ala Gly Asp Lys Val lie Ala Thr Asn Pro Glu 

20 25 30 



Thr Phe Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Glu lie lie Lys Thr 
50 55 "* 60 



Thr Leu Gly His Leu Phe Tyr Val Xaa Xaa Xaa Xaa Phe Val Glu Ala 
65 70 75 80 



Val Lys Leu Gin Pro Thr Asp Lys Leu Val Asp Ser Gly Gly Asn Val 

85 90 " 95 



Leu Val Val Glu Xaa Lys Lys Phe Glu Xaa Xaa Xaa Xaa Xaa Xaa Lys 

100 105 110 



Val Tyr Asn Phe Lys Val Asn Asp Phe Tyr Thr Tyr His Val Xaa Xaa 
115 120 125 



Asn Gly lie Leu Val His Asn Val 
130 135 



<210> 35 

<211> 136 

<212> PRT 

<213> Clostridium thermocellum 



<220> 

<221> miscjfeature 

<222> (36) . . (57) 

<223>' Xaa can be any naturally occurring amino acid 
<220> 

<221> mis cofeature 

<222> (59) . . (59) 

<223> Xaa can be any naturally occurring amino acid 

<220> . 

<221> misc feature 
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<222> (73) . . (76) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 
<222> (101) . . (101) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 
<222> (106) . . (Ill) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 
<222> (127) . . (128) 

<223> Xaa can be any naturally occurring amino acid 
<400> 35 

Cys Phe Val Ala Gly Thr Met lie Leu Thr Ala Thr Gly Leu Val Ala 
15 10 15 



lie Glu Asn He Lys Ala Gly Asp Lys Val He Ala Thr Asn Pro Glu 

20 25 30 



Thr Phe Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Glu He He Lys Thr 
50 ' 55 60 



Thr Leu Gly His Leu Phe Tyr Val Xaa Xaa Xaa Xaa Phe Val Glu Ala 
65 70 75 80 



Val Lys Leu Gin Pro Thr Asp Lys Leu Val Asp Ser Gly Gly Asn Val 

85 90 95 



Leu Val Val Glu Xaa Lys Lys Phe Glu Xaa Xaa Xaa Xaa Xaa Xaa Lys 

100 105 110 



Val Tyr Asn Phe Lys Val Asn Asp Phe Tyr Thr Tyr His Val Xaa Xaa 
115 120 125 



Asn Gly He Leu Val His Asn Val 
130 135 



<210> 36 

<211> 214 

<212> PRT 

<213> Leptospira interrogans 



<220> 

<221> misc^feature 

<;222> (36) . . (58) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (74) . . (151) 

<223> Xaa can be any naturally occurring amino acid 
<220> 
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<221> misc_feature 
<222> (181) . . (185) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 
<222> (201) . . (206) 

<223> Xaa can be any naturally occurring amino acid 
<400> 36 

Cys Phe Thr Ala Gly Ser Lys Val Thr Lys Leu Lys Asn Phe Ala Asn 
15 10 15 



lie Glu Glu lie Lys He Gly Asp He Val Arg Ser Trp Asn Glu Asn 

20 25 30 



Thr Asn Thr Xaa Xaa< Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Glu Glu Glu He His 
50 55 60 



Thr Thr Trp Asn His Pro Phe Arg Arg Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
65 70 75 80 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

85 90 95 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

100 105 110 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
115 120 125 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
130 135 140 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Trp Val Lys Val Glu Asp Leu Arg Leu 
145 150 155 " 160 



Lys Asp Gin Val Leu Arg Ser Asp Gly Ser Trp Gly Thr Val Thr Gly 

165 170 175 



He Tyr Tyr Tyr Xaa Xaa Xaa Xaa Xaa Lys Val Tyr Asn Leu Glu Val 

180 .185 190 



Glu Asp Asn His Thr Tyr Val Val Xaa Xaa Xaa Xaa Xaa Xaa He Gly 
195 200 205 



Tyr Val Val His Asn Tyr 
210 



<210> 37 

<211> 239 

<212> PRT 

<213> Leptospira interrogans 
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<220> 

<221> misc_feature 

<222> (14).. (42) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (65).. (87) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (103).. . (176) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> raisc_f eature 

<222> (206) . . (210) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (226) . . (231) 

<223> Xaa can be any naturally occurring amino acid 

<400> 37 

Cys Phe Val Ala Gly Ser Lys Val Thr Lys Leu Lys Asn Xaa Xaa Xaa 
1 5 10 15 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

20 25 30 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Ala Asn lie Glu Glu 
35 40 45 



He Arg He Gly Asp Val Val Arg Ser Trp Asn Glu Asn Thr Asn Thr 
50 55 60 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
65 70 75 80 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Glu Glu Glu He His Thr Thr Trp 

85 90 95 



Asn His Pro Phe Arg Arg Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

100 105 110 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
115 120 125 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
130 135 140 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
145 150 155 160 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

165 170 175 



Trp Val Lys Val Glu Asp Leu Arg Leu Arg Asp Gin Val Leu Arg Ser 

180 '185 190 
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Asp Gly Ser Trp Gly Thr Val Thr Gly He Tyr Tyr Tyr Xaa Xaa Xaa 
195 200 ' * 205 

Xaa Xaa Lys Val Tyr Asn Leu Glu Val Glu Asp Asn His Thr Tyr Val 
210 215 220 



Val Xaa Xaa Xaa Xaa Xaa Xaa Lys Gly Tyr Val Val His Asn Tyr 
225 230 235 



<210> 38 

<211> 144 

<212> PRT 

<213> Gemmata obscuriglobus 
<220> 

<221> mis cofeature 

<222> (36).. (59) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (75).. (78) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (94) . . (97) 

<223> Xaa can be any naturally occurring amino acid 

- . <220> 

<221> misc_feature 

<222> (112) . . (115) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc__feature 

<222> (131) . . <136) 

<223> Xaa can be any naturally occurring amino acid 

<400> 38 

Cys Phe Ala Ala Gly Thr Lys Leu Leu Thr Arg Arg Gly Trp Val Ala 
1 5 10 15 

Val Glu Leu Leu Gly He Gly Asp Glu Val Ala Ser Arg Thr Glu His 

20 25 30 



Asp Leii Thr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Gly Glu Leu He 
50 55 60 



Arg Thr Thr Pro Glu His Pro Phe Trp Val Xaa Xaa Xaa Xaa Trp Thr 
65 70 75 80 



Ala Ala Gly Ser Leu Ala Ala Gly Asp Arg lie Ala Thr Xaa Xaa Xaa 

85 90 95 

Xaa Leu Ser Gly Glu Trp Val Pro He Ala Glu Val Phe Asp Thr Xaa 

100 105 110 
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Xaa Xaa Xaa Pro Val Tyr Asn Leu Arg Val Ala Asp His His Thr Tyr 
115 120 125 



Phe Val Xaa Xaa Xaa Xaa Xaa Xaa Phe Ala Ala Trp Ala His Asn Ala 
130 135 140 



<210> 39 

<211> 139 

<212> PRT . 

<213> Gemmata obscuriglobus 
<220> 

<221> misc_f eature 

<222> (36) . . (58) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (74) . . (110) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (126).. (131) 

<223> Xaa can be any naturally occurring amino acid 

<400> 39 

Cys Phe Ala Ser Gly Thr Pro Met Arg Thr Pro Gly Gly Trp Cys Asn 
15 10 15 

lie Glu Asn Leu Arg Val Gly Asp Phe Val Leu Ser Arg Asp Glu Phe 

20 25 30 



Ser Pro Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 4 5 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Gly Gin Thr lie Arg 
50 55 60 



Ser Thr Asp Glu His Pro Phe Phe Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
65 70 75 80 

■ 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

85 90 95 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Thr Val 

100 105 110 

Tyr Asn Phe Arg Val Ala Asp His His Thr Tyr Phe Val Xaa Xaa Xaa 
115 120 125 



Xaa Xaa Xaa Phe Ser Val Trp Ala His Asn lie 
130 135 



<210> 40 

<211> 132 

<212> PRT 

<213> Pirellula sp. 
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<220> 

<221> misc_f eature 

<222> (36) . . (57) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (73).. (76) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc__f eature 

<222> (92).. (107) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc__f eature 

<222> (123) . . (124) 

<223> Xaa can be any naturally occurring amino acid 

<400> 40 

Cys Leu Val Ala Gly Thr Leu Val Trp Thr Asp Arg Gly Met Arg Pro 
15 10 15 



Val Glu Ser Leu Arg Leu Gly Asp Gin Val Leu Ser Cys Asp Val Gin 

20 25 30 



Thr Gly Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 -V. • 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Ser Asp Glu He Val Ala 
50 55 60 



Ser Lys Gly His Pro Phe Trp Val Xaa Xaa Xaa Xaa Trp Thr Thr Thr 
65 70 75 80 



Glu Gin Leu Val Pro Gly Asp Ala Leu His Gly Xaa Xaa Xaa Xaa Xaa 

85 90 95 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Lys Thr Tyr Asn Leu 

100 105 " 110 



Val Val Glu Gin Thr His Ser Tyr Phe Val Xaa Xaa Ser Arg He Leu 
115 120 125 



Ser His Asp Ala 
130 



<210> 41 

<211> 144 

<212> PRT 

<213> Streptomyces avermitilis 



<220> 

<221> misc_f eature 

<222> (14).. (14) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc feature 
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<222> (37) . . (64) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 
<222> (80).. (84) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 
<222> (114) . . (117) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 
<222> (133) . . (136) 

<223> Xaa can be any naturally occurring amino acid 
<400> 41 

Ser Phe Lys Pro Thr Thr Arg Val Leu Met Lys Asp Gly Xaa Thr Lys 
15 10 15 



Pro Leu Gly Lys lie Lys Pro Gly Asp Leu Val Glu Ala Ala Asp Pro 

20 25 30 



Thr Ser Gly His Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
50 55 60 



Arg lie Gin Thr Leu His Thr Thr Ala Arg His Arg He Trp Asp Xaa 
65 70 75 80 



Xaa Xaa Xaa Xaa Trp Glu Gin Ala Gly Arg Leu He Thr Gly His Lys 

85 90 95 



Val Asn Thr Ser Gly Asn Gin His Ala Thr He Thr Ser Val Leu Ala 

100 105 110 



Gin Xaa Xaa Xaa Xaa Asp Met Tyr Asp Leu Thr Val Glu Gly Leu His 
115 120 ■ 125 



Thr Tyr Tyr Val Xaa Xaa Xaa Xaa Thr Pro Val Leu Val His Asn Gly 
130 135 140 



<210> 42 

<211> 143 

<212> PRT 

<213> Strejptomyces avermitilis 



<220> 

<221> misc_feature 

<222> (14).. (14) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_ feature 

<222> (37) . . (63) 

<223> Xaa can be any naturally occurring amino acid 
<220> 
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<221> misc_feature 
<222> (79).. (83) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (113) . . (116) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (132) .. (135) 

<223> xaa can be any naturally occurring amino acid 

<400> 42 

Cys Phe Leu Ala Gly Thr Asp He Leu Met Ala Asp Gly Xaa Thr Lys 
1 5 10 



15 



Asp He Glu Glu Val Glu Leu Gly Asp Lys Val Gin Ala Thr Asp Pro 

20 25 30 

Lys Thr Gly Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly 
DU 55 go 

Ala Glu Glu Leu Thr Ala Thr His Glu His Pro Phe Trp Ser Xaa Xaa 

70 75 80 

Xaa Xaa Xaa Trp He Thr Ala Gly Ser Leu Glu Pro Gly Met Thr Leu 

8 5 90 95 

Leu Thr Asp Asp Gly Asp Thr Val lie Val Thr Gly Asn Arg Ala Phe 

100 105 no 

Xaa Xaa Xaa Xaa Thr Thr Tyr Asn Leu Thr Val Asn Asp Leu His Thr 

120 125 

Tyr Tyr Ala Xaa Xaa Xaa Xaa Thr Pro Val Leu Val His Asn Ser 
130 135 i 4 o 

<210> 43 

<211> 146 

<212> PRT 

<213> Streptomyces avermitilis 
<220> 

<221> misc_feature 

<222> (14).. (15) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc__feature 

<222> (38).. (66) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (82).. (86) 

<223> Xaa can be any naturally occurring amino acid 
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<220> 

<221> misc_f eature 
<222> (116) . . (119) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 
<222> (135) . . (138) 

<223> Xaa can be any naturally occurring amino acid 
<400> 43 

Ser Phe Pro Ala Gly Thr Arg Val Leu Met Gly Asp Gly Xaa Xaa Thr 
1 5 io is 

Leu Pro lie Glu Gin He Thr Val Gly Asp Ser Val Leu Ala Thr Asp 

20 25 30 

Pro Glu Ala Gly Thr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
50 55 60 

Xaa Xaa Gly Pro Pro Ala Leu Thr Ala Thr Asp Arg His Pro Phe Trp 
65 70 75 80 

Val Xaa Xaa Xaa Xaa Xaa Trp Ala Asp Ala Arg Asp Leu Asn Ser Gly 

85 90 95 

Asp Thr Leu Arg Thr Pro Asp Gly Thr Gly Val Arg He Asp Lys Val 

100 105 no 

Thr His Trp Xaa Xaa Xaa Xaa Gly Ala Tyr Asn Leu Thr Val Asn Asp 

120 125 

Leu His Thr Tyr Tyr Val Xaa Xaa Xaa Xaa Val Pro Val Leu Val His 
130 135 140 



Asn Ala 
145 



<210> 44 

<211> 129 

<212> PRT 

<213> Myxococcus xanthus 



<220> 

<221> misc_f eature 

<222> (14).. (14) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (37) . . (58) 

<;223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (74) -. . (106) 

<223> Xaa can be any naturally occurring amino acid 
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<400> 44 

Cys Val Ala Pro Trp Glu Leu Val Leu Leu Gly Asp Gly Xaa Glu Val 
1 5 io 15 

Pro Ala Glu Met Leu Arg Pro Gly Met Arg Val Leu Thr Met His Glu 

20 25 30 

His Glu Arg Asp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asp Gly Arg Val Leu Val 
50 55 60 

r 

Val Thr Pro Asp His Arg Trp Arg Thr Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
65 70 75 80 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

85 90 95 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asp Val Met Arg lie Thr 

100 105 no 

Val Arg Phe Ala Met Thr Tyr lie Val Gin Gly Leu Leu Ala His Asn 
115 120 125 



Leu 



<210> 45 

<211> 131 

<212> PRT 

<213> Myxococcus xanthus 



<220> 

<221> misc_feature 
<222> (14) . . (14) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (37).. (58) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc__feature 

<222> (74).. (108) 

<223> Xaa can be any naturally occurring amino acid 

<400> 45 

Cys Val Ala Pro Trp Glu Pro Val Leu Leu Ser Asp Gly Xaa Glu Val 
1 5 10 15 

Pro Ala Glu Met Leu Arg Pro Gly Met Lys Val Leu Thr Met His Glu 

20 25 30 

His Glu Arg Asp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 
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Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asp Gly Arg Ala Val Val 
50 55 60 



Val Thr Pro Asp His Arg Trp Arg Thr Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
65 70 75 80 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

85 90 95 



Xaa Xaa Xaa .Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asp Val Met Lys 

100 105 110 



He Ser Val Arg Phe Ala Lys Thr Tyr Val Val Gin Gly Leu Leu Ala 
115 120 125 



His Asn Leu 
130 



<210> 46 

<211> 137 

<212> PRT 

<213> Verrucomicrobium spinosum 
<220> 

<221> misc_feature 

<222> t36) . . (57) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (73).. (77) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (107) . . (112) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (128) . . (129) 

<223> Xaa can be any naturally occurring amino acid 

<400> 46 

Cys Phe Pro Ser Gly Thr Met Val Gin Thr Ala Arg Gly Lys Val Ala 
1 5 10 15 

He Glu Thr Leu Lys Glu Gly Asp Val Val Leu Ala Tyr Asp Phe Leu 

20 25 30 



Ser Glu Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Asp Ser Lys He Ser Ala 

50 55 60 



Thr Arg Phe His Leu Phe Trp Val Xaa Xaa Xaa Xaa Xaa Trp Val Pro 
S3 70 75 80 
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Ala Val Asp Leu Gin Pro Gly Met Val Leu Arg Leu Glu Ser Gly Ala 

85 90 95 



Leu Thr Val Val Thr Leu Ala Lys Leu Arg Xaa Xaa Xaa Xaa Xaa Xaa 

100 105 110 



Ala Thr His Asn Phe Glu Val Ala Asp Leu His Asn Tyr Phe Val Xaa 
115 120 125 



Xaa Gin Gly .Phe Leu Val His Asn Gly 
130 135 



<210> 47 

<211> 140 

<212> PRT 

<213> Verrucomicrobium spinosum 
<220> 

<221> misc_f eature 

<222> (14).. (14) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> raisc_feature 

<222> (37) . . (60) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (76) . . (79) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (95).. (113) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mis cofeature 

<222> (129) . . (132) 

<223> Xaa can be any naturally occurring amino acid 

<400> 47 

Cys Phe Pro Ala Gly Thr Met Val Leu Met Ala Asp Gly Xaa Ser Val 
1 5 10 .15 



Pro lie Glu Gin Val Val Glu Gly Asp lie Val Leu Ala Ala Glu Pro 

20 25 30 



Glu Thr Glu Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Thr Gly Ser Val 
50 55 60 

* 

Leu Lys Val Thr Gly Glu His Pro He Trp Thr Xaa Xaa Xaa Xaa Trp 
65 70 75 80 



Gin His Ala Asp Asp Leu Val Glu Gly Asp Leu Leu Leu Lys Xaa Xaa 

85 90 " 95 
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Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

100 105 no 

Xaa Asp Thr Phe Asn Leu Cys Val Glu Gly val His Thr Phe Tyr Val 
115 120 125 

Xaa Xaa Xaa Xaa Asp Ala Val Leu Val His Asn Thr 
130 135 140 

<210> 48 
<211> 145 
<212> PRT 

<213> Verrucomicrobium spinosum 
<220> 

<221> misc_feature 
<222> (14).. (14) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (37).. (66) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (82).. (85) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (101) . . (118) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (134) . . (137) 

<223> Xaa can be any naturally occurring amino acid 

<400> 48 

Cys Phe Ala Pro Gly Thr Pro Val Leu Met Gly Asp Gly Xaa Thr Arg 
1 5 10 is 

Pro Val Glu Thr lie Arg Glu Gly Asp Trp He Met Ala Asp Asp Pro 

20 25 . 3 0 

Glu Asp Glu Arg Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 

Xaa xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
50 55 60 

Xaa Xaa Pro Asp Gly Ala Leu Lys Ala Thr Gly Gly His Pro Phe Trp 
65 70 75 80 

* 

Thr Xaa Xaa Xaa Xaa Trp He Lys Val Cys Asn Leu Gin Pro Asn Asp 

85 90 95 

He Leu Ala Asp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

100 los no 
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Xaa Xaa Xaa Xaa Xaa Xaa Ala Thr Tyr Asn Leu Ser Val Ala Asn lie 
115 120 125 



His Thr Phe Phe Val Xaa Xaa Xaa Xaa Val Pro Val Leu Val His Asn 
130 135 140 



Thr 
145 



<210> 49 

<211> 141 

<212> PRT 

<213> Gloeobacter violaceus 



<220> 

<221> misc_f eature 

<222> (36).. (64) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (80) . . (83) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (113) . . (116) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (132) . . (133) 

<223> Xaa can be any naturally occurring amino acid 

<400> 49 

Cys Phe Ala Glu Gly Thr Glu Val Gin Thr Glu Thr Gly Thr Lys Ala 
1 5 10 15 

lie Glu Lys Val Glu Pro Gly Glu Lys Val Leu Ala Arg Asn Glu Lys 

20 25 30 



Thr Gly Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
50 55 60 



Glu Arg Asp Thr Leu Thr Val Thr Gly Glu His Pro Phe Phe Leu Xaa 
65 70 75 80 



Xaa Xaa Xaa Trp Thr Ala Ala Glu Arg Leu Arg Ser Gly Glu Arg Val 

85 90 95 



Gin Ala Val Asp Gly Lys Trp Leu Arg Val Val Gly Leu Gin Pro Gin 

100 . 105 " 110 



Xaa Xaa Xaa Xaa Arg Thr Tyr Asn Leu Glu Val Glu Gly Glu His Thr 
115 120 125 
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Phe Phe Val Xaa Xaa Thr Arg Ala Trp Val His Asn Glu 
130 135 140 



<210> 50 

<211> 141 

<212> PRT 

<213> Gloeobacter violaceus 



<220> 

<221> misc_feature 

<222> (36) .:. (64) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (80).. (83) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (113) . . (116) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (132) . . (133) 

<223> Xaa can be any naturally occurring amino acid 

<400> 50 

Cys Phe Ala Glu Gly Thr Glu Val Gin Thr Glu Thr Gly Ala Lys Pro 
1 5 10 15 



lie Glu Leu Val Ala Pro Gly Glu Lys Val Leu Ala Arg Asn Glu Gin 

20 25 30 



Thr Gly Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
50 55 60 



Asp Arg Asp Val Leu Thr Val Thr Gly Glu His Pro Phe Phe Leu Xaa 
65 70 75 80 



Xaa Xaa Xaa Trp Thr Ala Ala Asp Lys Leu Gin Val Gly Glu Arg Val 

85 90 95 



Gin Thr Val Asp Gly Gin Trp Leu Arg Val Ala Gly Leu Gin Ala Gin 

100 105 110 



Xaa Xaa Xaa Xaa Arg Thr Tyr Asn Leu Glu Val Glu Arg Asp His Thr 
115 120 125 



Phe Phe Val Xaa Xaa Ser Lys Ala Trp Val His Asn Glu 
130 135 140 



<210> 51 

<211> 141 

<212> PRT 

<213> Gloeobacter violaceus 
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<220> 

<221> misc_f eature 

<222> (36).. (64) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (80).. (83) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (113) . . (116) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (132) . . (133) 

<223> Xaa can be any naturally occurring amino acid 

<400> 51 

Cys Phe Ser Glu Gly Thr Glu Val Gin Thr Glu Ala Gly Ala Lys Pro 
1 5 io 15 

lie Glu Leu Val Glu Pro Gly Glu Lys Val Leu Ala Arg Asn Glu Gin 

20 25 30 

Thr Gly Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
50 55 60 

Glu Arg Asp Thr Leu Thr Val Thr Gly Glu His Pro Phe Phe Leu Xaa 
65 70 75 80 

Xaa Xaa Xaa Trp Thr Ala Ala Glu Arg Leu Lys Ser Gly Glu Arg Val 

85 90 95 

Gin Ala Ala Asp Gly Lys Trp Leu Arg Val Ala Gly Leu Glu Ala Gin 

100 105 HO 

Xaa Xaa Xaa Xaa Arg Thr Tyr Asn Leu Glu Val Glu Gly Asp His Thr 
115 120 125 

Phe Phe Val Xaa Xaa Asn Gin Ala Trp Val His Asn Glu 
130 135 140 



<210> 52 

<211> 139 

<212> PRT 

<213> Chromobacterium violaceum 



<220> 

<221> misc_feature 

<222> (36)7.(62) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (78).. (81) 
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<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (97) . . (114) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc feature 

<222> (1307..(131) 

<223> Xaa can be any naturally occurring amino acid 

<400> 52 

Cys Phe Val Ala Gly Thr Gin Val Leu Thr Asp Lys Gly Leu Lys Ala 

1 5 10 15 

He Glu Thr Phe Val Gly Gly Glu Trp Val Trp Ser Arg Ser Asp Gin 

20 25 30 

Thr Gly Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Arg Gin 
50 55 6 o 

Glu Thr Phe Arg Thr Thr Ala Glu His Pro Phe Trp Val Xaa Xaa Xaa 
65 70 75 80 

Xaa Trp Leu Lys Ala Ser Leu Leu Gin Ala Gly Val He Leu Val Asp 

85 90 95 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

100 105 no 

Xaa Xaa Thr Val Phe Asn He Gin Val Ala Glu Phe Gin Thr Tyr His 
115 120 125 

Val Xaa Xaa Leu Gly Val Trp Val His Asn Ala 
130 135 

<210> 53 

<211> 141 

<212> PRT 

<213> Gloeobacter violaceus 
<220> 

<221> misc_feature 

<222> (36)... (64) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (80)..<83) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (113) . . (116) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc feature 
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<222> (132) . . (133) 

<223> Xaa can be any naturally occurring amino acid 
<400> 53 

Cys Phe Ala Glu Gly Thr Glu Val Gin Thr Glu Thr Gly Thr Lys Ala 
15 10 15 



lie Glu Lys Val Glu Pro Gly Glu Lys Val Leu Ala Arg Asn Glu Lys 

20 25 30 



Thr Gly Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
50 55 60 



Glu Arg Asp Thr Leu Thr Val Thr Gly Glu His Pro Phe Phe Leu Xaa 
65 70 75 80 



Xaa Xaa Xaa Trp Thr Ala Ala Asp Lys Leu Gin Ala Gly Asp Arg Val 

85 90 95 



Gin Ala Val Asp Gly Arg Trp Leu Arg Val Val Gly Leu Ala Ala Gin 

100 105 110 



Xaa Xaa Xaa Xaa Arg Thr Tyr Asn Leu Glu He Glu Gly Glu His Thr 
115 120 125 



Phe Phe Val Xaa Xaa Asn Gin Ala Trp Val His Asn Glu 
130 135 140 



<210> 54 

<211> 93 

<212> PRT 

<213> Gloeobacter violaceus 



<220> 

<221> misc_feature 

<222> (36).. (78) 

<223> Xaa can be any naturally occurring amino acid 

<400> 54 

Cys Phe Gly Glu Gly Thr Ala Val Gin Thr Glu Thr Arg Ala Lys Pro 
1 5 10 15 



He Glu Gin He Glu Pro Gly Glu Lys Val Leu Ala Arg Ser Glu Arg 

20 25 30 



Thr Gly Gin Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
50 55 60 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Arg 
65 70 75 80 
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Asp Thr Ser Thr Val Thr Gly Glu His Pro Phe Tyr Leu 

85 90 



<210> 55 

<211> 81 

<212> PRT 

<213> Neisseria meningitidis (group C) 
<220> 

<221> misc^feature 

<222> (16) . . (18) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc__feature 

<222> (48) . . (51) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (67) . . (73) 

<223> Xaa can be any naturally occurring amino acid 

<400> 55 

Asn Ser Gin lie Leu lie Ser Asn Arg lie His Pro Phe Tyr Ser Xaa 
15 10 15 



Xaa Xaa Trp lie Lys Ala Glu Asp Leu Lys Ala Gly Ser Arg Leu Leu 

20 25 30 



Ser Glu Ser Gly Lys Thr Gin Thr Val Arg Asn lie Val Val Lys Xaa 
35 40 45 



Xaa Xaa Xaa Lys Ala Tyr Asn Leu Thr Val Ala Asp Trp His Thr Tyr 
50 55 60 



Phe Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Gly Val Trp Val His Asn 
65 70 75 80 



Asp 



<210> 56 

<211> 91 

<212> PRT 

<213> Clostridium thermocellum 



<220> 

<22l> misc_feature " 

<222> (36).. (57) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (73).. (76) 

<223> Xaa can be any naturally occurring amino acid 

<400> 56 



Cys Phe Val Ala Gly Thr Met lie Leu Thr Ala Thr Gly Leu Val Ala 
15 10 15 

* 
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lie Glu Asn He Lys Ala Gly Asp Lys Val He Ala Thr Asn Pro Glu 

20 25 30 



Thr Phe Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn Gly Glu Val He Lys Thr 
50 55 60 



Thr Phe Glu His Pro Phe Tyr Val Xaa Xaa Xaa Xaa Phe Val Glu Ala 
65 70 75 80 



Gly Lys Leu Gin He Gly Asp Arg Leu Val Asp 

85 90 



<210> 57 

<211> 45 

<212> PRT 

<213> Clostridium thermocellum 



<220> 

<221> misc_feature 
<222> (15).. (20) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 
<222> (36) . . (37) 

<223> Xaa can be any naturally occurring amino acid 
<400> 57 

Ser Lys Gly Asn Val Leu Val Val Glu Glu Lys Lys Leu Glu Xaa Xaa 
15 10 15 

Xaa Xaa Xaa Xaa Lys Val Tyr Asn Phe Lys Val Asn Asp Phe His Thr 

20 25 30 



Tyr His Val Xaa Xaa Asp Gly He Leu Val His Asn Ala 
35 40 45 



<210> 58 

<211> 24 

<212> PRT 

<213> Clostridium thermocellum 



<220> 

<221> misc_f eature 

<222> (15) . . (16) 

<223> Xaa can be any naturally occurring amino acid 

<400> 58 

* 

Val Tyr Asn Phe Lys Val Asp Asn Phe His Thr Tyr His Val Xaa Xaa 
1 5-10 15 



Asn Arg Val Leu Val His Asn Ala 

20 



<210> 59 
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<211> 39 
<212> PRT 

<213> Clostridium thermocellum 



<220> 

<221> misc_f eature 
<222> (16).. (34) 

<223> Xaa can be any naturally occurring amino acid 
<400> 59 

♦ 

Phe Val Lys Glu Met Lys Leu Gin Pro Gly Asn Arg Leu Val Asp Xaa 
15 10 15 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

20 25 30 



Xaa Xaa Lys Val Tyr Asn Phe 
35 



<210> 60 

<211> 63 

<212> PRT 

<213> Leptospira interrogans 



<220> 

<221> misc_feature 
<222> (30) ... (34) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 
<222> (50).. (55) 

<223> Xaa can be any naturally occurring amino acid 
<400> 60 

Trp Val Lys Val Glu Asp Leu Arg Leu Arg Asp Gin Val Leu Arg Ser 
15 10 15 



Asp Gly Ser Trp Gly Thr Val Thr Gly He Tyr Tyr Tyr Xaa Xaa Xaa 

20 25 30 



Xaa Xaa Lys Val Tyr Asn Leu Glu Val Glu Asp Asn His Thr Tyr He 
35 40 45 



val Xaa Xaa Xaa Xaa Xaa Xaa He Gly Tyr Val Val His Asn Tyr 
50 55 60 



<210> 61 

<211> 10 

<212> PRT 

<213> Gloeobacter violaceus 

<400> 61 

Cys Phe Ala Glu Gly Thr Glu Val Gin Thr 
1 5 10 



<210> 62 

<2ll> 15 

<212> PRT 

<213> GloeobaGter violaceus 



WO 2004/043997 



48 

<400> 62 

Trp Thr Ala Ala Glu Arg Leu Glu Pro Gly Asp Arg Val Gin Ala 
1 5 io ' ' 15 



<210> 63 

<211> 153 

<212> PRT 

<213> Rhodobacter capsulatus 
<220> 

<221> misc_feature 

<222> (31).. (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (45).. (56) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (67) . . (80) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (95).. (106) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (122) . . (129) 

<223> Xaa can be any naturally occurring amino acid 

<400> 63 

Cys Phe Thr Pro Gly Thr Leu He Asp Thr Pro Ala Gly Pro Arg Pro 
1 5 ' 10 15 ' 

Val Glu Ala Leu Arg Pro Gly Asp Arg Val Ser Thr Arg Asp Xaa Xaa 

20 25 30 

■ 

Xaa Gin Glu He Leu Trp He Gly Ser Arg Arg Met Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa . Pro Val Arg Leu Gly Ala Val Arg 
50 55 60 

i 

t 

Leu Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
65 70 75 .80 

Ala Ala Asp Leu Leu Val Ser Pro Gin His Arg Val Leu Val Xaa Xaa 

85 90 95 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Val Leu Val Gin Ala 

100 ' 105 no 

i 

Cys Asp Leu Val Asp Asp Ala Ala Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
115 120 125 
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Xaa Val Thr Tyr Leu His Leu Leu Phe Ala Arg His Gin Val lie Arg 
130 135 140 



Ala Asn Gly Val Glu Thr Glu Ser Phe 
145 150 



<210> 64 

<211> 141 

<212> PRT 

<213> Rhodobacter capsulatus 

• 

<220> 

<221> misc_f eature 

<222> (31).. (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (45).. (56) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (67).. (67) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (82).. (93) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (109) . . (116) 

<223> Xaa can be any naturally occurring amino acid 

<220> ... 

<221> misc^feature 

<222> (134) . . (134) 

<223> Xaa can be any naturally occurring amino acid 

<400> 64 

Gly Phe Tyr Gly Glu Thr Val Leu Gin Thr Ala Arg Gly Leu Arg Arg 
1 5 10 15 



Val Ser Ser lie Leu Glu Gly Glu Lys Met Arg Thr Phe Thr Xaa Xaa 

20 . 25 30 



Xaa Ala Pro Val Leu Ser He Glu Arg Phe Ala Leu Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Leu Ser Leu Pro Ala Gly Leu 
50 55 60 



Phe Gly Xaa Thr Arg Asn Arg Phe Val Ala Pro Glu Gin Cys Leu Leu 
65 70 75 80 



Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Leu Leu 

85 90 95 



Val Pro Ala Lys Val Leu Gly Leu Leu Pro Gin Val Xaa Xaa Xaa Xaa 

100 105 110 
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Xaa Xaa Xaa Xaa Ala Val Leu Tyr Arg Leu Leu Phe Glu Arg Pro Glu 
115 120 125 



Leu Val Val Thr Asp Xaa Gly Ala Val Met Leu Cys Asp 
130 135 140 



<210> 65 

<211> 134 

<212> PRT 

<213> Rhodqbacter capsulatus 
<220> 

<221> misc__feature 

<222> (31).. (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (45).. (51) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (62).. (62) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc__f eature 

<222> (77).. (88) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mis c__f eature 

<222> (104) . . (110) 

<223> Xaa can be any naturally occurring amino acid 

<400> 65 

Gly Phe Ala Ala Gly Thr Arg Val Arg Thr Pro Ala Gly Leu Arg Arg 
1 5 10 15 

He Glu Thr Leu Lys Pro Gly Asp Leu Vai Glu Thr Gin Glu Xaa Xaa 

20 25 30 



Xaa Gin Pro Val Val Ala Vai Glu Arg Thr Arg Leu Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Pro He Arg Phe Ala Ala Gly Ala His Gly Xaa Glu Arg 
50 55 60 



Pro Val Leu Val Ala Pro Gin Gin Arg Val Leu Val Xaa Xaa Xaa Xaa 
65 70 75 80 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Val Leu Val Ala Ala Arg Thr 

85 90 95 

* 

Leu Val, Asp Gly Glu Met Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val Asp 

100 105 no 



Tyr Val Arg Leu Val Phe Asp Cys Ala His Met Val Phe Ala Glu Gly 
115 120 125 
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Leu Ala Val Glu Cys Phe 
130 



<210> 66 

<211> 135 

<212> PRT 

<213> Rhodobacter capsulatus 



<220> 

<221> misc feature 

<222> (31)7.(33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (45) . . (51) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (62).. (62) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (77). -(88) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc feature 

<222> (104T. . (Ill) 

<223> Xaa can be any naturally occurring amino acid 

<400> 66 

Cys Phe Ala Pro Ser Thr Pro lie Ala Thr Pro Gly Gly Asp Cys Pro 
1 5 10 15 



Ala Ala Ser Leu Lys Ala Gly Asp Leu Val Leu Thr Ala Asp Xaa Xaa 

20 25 30 



Xaa Gin Pro lie Leu Trp Ser Gly Arg lie Ala Leu Xaa Xaa Xaa Xaa 
35 40 " 45 



Xaa Xaa Xaa Pro Val Arg Leu Cys Ala Pro Ala Phe Gly Xaa Thr Arg 
50 55 60 ' 



Asp Leu Trp Val Leu Pro Gin His Arg Val Ala Leu Xaa Xaa Xaa Xaa 
65 70 75 80 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Val Leu Val Pro Ala His His 

85 90 95 



Leu Val Asp Gly lie Ser Ala Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu 

100 105 110 

Ser Trp His Gly Leu Leu Leu Gin Gly His His Leu Leu lie Ala Asp 
115 120 125 



Gly Cys Arg Val Glu Ser Leu 
130 135 
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<210> 67 

<211> 135 

<212> PRT 

<213> Rhodobacter capsulatus 



<220> 

<221> misc_feature 

<222> (31) . . (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (45).. (56) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature - 

<222> (67) . . (67) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (82) . . (86) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (102).. (Ill) 

<223> Xaa can be any naturally occurring amino acid 

<400> 67 

Cys Phe Thr Ala Gly Thr Leu lie Glu Thr Pro Arg Gly Pro Val Pro 
15 10 15 



Val Glu Ser Leu Arg Ala Gly Asp Leu Val Val Thr Arg Asp Xaa Xaa 

20 25 30 



Xaa Val Pro Val Leu Trp Ser Gly Gly Arg Ser Leu Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Val Ala He Arg Glu Asn Ala 
50 55 60 



Leu Gly Xaa His Gly Ala Leu Leu Leu Ser Pro Gin His Ala Val Leu 
65 70 75 80 



Ala Xaa Xaa Xaa Xaa Xaa Glu Arg Leu Val Arg Ala Arg His Leu Ala 

85 90 95 



Gly Leu Asn Asp Pro Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val 

100 105 110 



Ser Tyr His His He Leu Leu Glu Arg His Gly He Val Thr Ala Asn 
115 120 125 



Gly Leu Ala Cys Glu Ser Leu 
130 135 



<210> 68 
<211> 146 
<212> PRT 
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<213> Rhodobacter capsulatus 



<220> 

<221> misc_feature 

<222> (31) . . (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (45) . . (59) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (70) . . (73) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (88) . . (99) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (115) . . (122) 

<223> Xaa can be any naturally occurring amino acid 

<400> 68 

Ala leu Ala Arg Gly Ser Val Leu Met Thr Glu Asp Gly Pro Val Ala 
15 10 15 



lie Glu Asp Leu Gin Pro Gly Gin Gly Val Leu Thr Ala Glu Xaa Xaa 

20 25 30 



Xaa Glu Arg Val Cys Trp lie Gly Ser Met Val lie Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Thr Arg lie Thr 
50 55 60 



Ala Glu Ala Phe Gly Xaa Xaa Xaa Xaa Ala Leu Asp Leu Val Leu Gly 
65 70 75 80 



Pro Arg Ala Arg Leu Cys Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

85 90 95 



Xaa Xaa Xaa Ala Ala Asp Val Pro Ala Arg Ala Phe Leu Asp Gly lie 

100 105 110 



Ser Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val Thr Val Tyr His Val 
115 120 125 



Val Leu Glu Gin His Gly Ser Leu Arg Val Ala Gly Leu Glu Val Glu 
130 135 140 



Ala Phe 
145 



<210> 69 
<211> 136 
<212> PRT 
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<213> Rhodobacter capsulatus 
<220> 

<221> misc__f eature 

<222> (31).. (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (45) . . (53) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (64) . . (67) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc^f eature 

<222> (82) . . (93) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (109) . . (112) 

<223> Xaa can be any naturally occurring amino acid 

<400> 69 

Cys Leu Gly Thr Gly Thr Met He Ala Thr Ala Glu Gly Pro Ala Pro 
15 10 15 

He Asp Trp Leu Arg Pro Gly Asp Arg Val Leu Thr Arg Asp Xaa Xaa 

20 25 30 

Xaa Gin Pro Leu Leu Trp Val Gly Gin His Thr Met Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Xaa Xaa Xaa Pro Leu Leu Leu Ser Ala Ala Cys Phe Gly Xaa 
50 55 60 

Xaa Xaa Xaa Glu Arg Asp Val Leu Leu Ser Pro Gly Thr Gly Val Leu 
65 70 75 " 80 

Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Met Phe 

85 90 95 

Ala Lys Ala Arg His Ala Leu Pro Lys Ala Glu Ala Xaa Xaa Xaa Xaa 

100 105 HO 

Gin Lys Leu Tyr Ser Met Leu Leu Ala Thr Pro Glu Val Val Leu Ala 
115 120 125 

Glu Gly Met Trp Val Gly Ser Val 
130 135 



<210> 70 

<211> 132 

<212> PRT 

<213> Rhodobacter -capsulatus 



<220> 
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<221> misc__f eature 
<222> (31).. (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 
<222> (45).. (51) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 
<222> (62).. (62) 

<223> Xaa dan be -any nat« rally occurring amino acid 
<220> 

<221> misc_f eature 
<222> (77).. (86) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 
<222> (102) . . (108) 

<223> Xaa can be any naturally occurring amino acid 
<400> 70 

Cys Phe Ala Ala Gly Thr Leu lie Ala Thr Arg Arg Gly Pro Lys Pro 
15 10 15 




Val Glu Asp Leu Gly Pro Glu Asp Arg Leu Gin Thr Ser Asp Xaa Xaa 

20 25 30 



Xaa Arg Pro Val Gin Trp Val Gly Arg Trp Arg Val Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Pro Val Arg Phe Ala Pro Gly Val Leu Gly Xaa Asp Arg 
50 55 60 



Ala Leu Phe Leu Ser Gly Gin His Arg Val Leu lie Xaa Xaa Xaa Xaa 
65 70 75 80 



Xaa 'Xaa Xaa Xaa Xaa Xaa Glu Val Leu Val Ala Ala Lys Ala Leu Val 

85 90 95 



Gly Leu Pro Gly lie Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val Asp Trp Val 

100 105 110 

His Val Met Met Pro Thr His Glu Val He Phe Ala Glu Asn Ala Arg 
115 120 125 

Ala Glu Thr Met 
130 



<210> 71 

<211> 135 

<212> PRT 

<213> Rhodobacter 



capsulatus 



<220> 

<221> mi sc_f eature 

<222> (14) . . (14) 

<223> Xaa can be any naturally occurring amino acid 
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<220> 

<221> misc_ f eature 

<222> (32).. (34) 

<223> Xaa can be any narturj^lly occurring amino acid 
<220> 

<221> misc__feature 

<222> (46).. (52) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (63). ; . (63) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc — feature 

<222> (78).. (89) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (105) . . (Ill) 

<223> Xaa can be any naturally occurring amino acid 

<400> 71 

Ala Phe Thr Thr Gly Thr Leu lie Thr Met Ala Gly Gly Xaa Gin Arg 
1 5 10 15 



Pro lie Glu Thr Leu Ala Pro Gly Asp Arg Val Leu Thr Arg Asp Xaa 

20 25 30 



Xaa Xaa Gin Pro Val Arg Leu Val Ala Arg Ala Thr Leu Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Pro Val Val lie Ser Ala Gly Thr Leu Gly Xaa Glu 
50 55 60 



Ser Asp Leu Val Val Ala Pro His His Arg Val Phe Leu Xaa Xaa Xaa 
65 70 75 80 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu lie Leu Val Gin Ala Lys 

85 90 95 



His Leu Val Asp Gly Glu His Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val 

100 105 110 



Asp Tyr Phe Ala Leu Val Phe Asp Arg His Glu lie Val Tyr Ala Glu 
115 120 125 



Gly Val Pro Val Glu Ser Leu 
130 135 



<210> 72 

<211> 139 

<212> PRT ' 

<213> Rhodotoacter capsulatus 



<220> 

<221> misc_feature 

<222> (31).. (33) 

<223> Xaa can be any naturally occurring amino acid 
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<220> 

<221> misc_feature 

<222> (45).. (56) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc__f eature 

<222> (67) . . (70) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_,f eature 

<222> (85).. (93) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eature 

<222> (109) . . (115) 

<223> Xaa can be any naturally occurring amino acid 

<400> 72 

Cys Phe Thr Ala Thr Ser Leu lie Ala Thr Gly Gin Gly Gly Val Pro 

15 10 15 



Val Ser Glu Leu Val Pro Gly Ala Arg Val lie Thr Arg Asp Xaa Xaa 

20 25 30 



Xaa Gin Glu Leu Leu Trp Val Gly Arg Arg Arg Phe Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Val Arg lie Ala Ala Gly Ala 
50 55 60 



Leu Gly Xaa Xaa Xaa Xaa Glu Arg Asp Met Leu Val Ser Pro Asn His 
65 70 75 80 



Arg Phe Leu Thr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Arg Leu 

85 90 95 



Thr Met Ala Arg Asp Leu Val Gly Leu Asp Gly lie Xaa Xaa Xaa Xaa 

100 105 110 



Xaa Xaa Xaa Val Asp Tyr Trp Gin Leu Leu Phe Ala His His Glu Leu 
115 120 125 



Val Leu Ala Asp Gly Ala Trp Ser Glu Ser Phe 
130 135 



<210> 73 

<211> 142 

<212> PRT 

<213> Rhodobacter capsulatus ' 



<220> 

<221> misc^f eature 

<222> (31).. (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> niisc_feature 

<222> (45).. <56) 
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<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (67) . . (70) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (85) . . (96) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (112) . . (118) 

<223> Xaa can be any naturally occurring amino acid 

<400> 73 

Cys Leu Thr Pro Gly Thr Leu lie Glu Thr Lys Arg Gly Gin Val Pro 
1 5 10 " 15 

Val Glu Lys Leu Arg Pro Gly Asp Arg Val Leu Thr Arg Asp Xaa Xaa 

20 25 30 

Xaa Gin Pro lie Arg Trp He Gly Arg Arg Arg Leu Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Val Arg He Ala Ala Gly Ala 
50 55 60 

Leu Gly Xaa Xaa Xaa Xaa Glu Thr Asp Met Leu Val Ser Pro Gin His 
65 70 75 80 

Arg Met Leu He Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

85 90 95 

■ 

Glu Val Leu Ala Ala Ala Leu His Met Leu Gly Gin Pro Gly He Xaa 

100 105 HO 

Xaa Xaa Xaa Xaa Xaa Xaa Val Thr Tyr Leu His Leu Met Leu Asp Ala 
115 120 125 

His Glu He He Arg Ala Asn Gly Ala Trp Thr Glu Ser Phe 
130 135 140 

<210> 74 

<211> 136 

<212> PRT 

<213> Rhodobacter capsulatus 
<220> 

<221> misc feature 

<222> 131)7. -(33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (45) . . (56) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc feature 




WO 2004/043997 

59 

<222> (67) . . (67) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 
<222> (82).. (83) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc feature 
<222> (99)7.(112) 

<223> Xaa can be any naturally occurring amino acid 
<400> 74 

Cys Leu Val Ala Gly Ser Arg Val Ser Thr Pro Arg Gly Pro Val Pro 
1 5 10 " " 15 

Val Glu Asp Leu Arg Pro Glu Asp Leu Val Thr Val Arg Asp Xaa Xaa 

20 25 30 

Xaa Leu Pro Val Leu Trp He Gly Arg Arg Arg Val Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Val Glu He Gly Ala Gly Arg 
50 55 60 

Leu Gly Xaa Ala Ala Pro Val Arg Leu Ser Ala Leu His Gly He Ala 
6 5 70 75 - 80 

Val Xaa Xaa Gly Phe Leu Ala Arg Ala Gly His Leu Ala Ala Thr Gly 

85 90 95 

Trp Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

100 105 no 

Val Leu Tyr Leu His Leu Leu Leu Pro Arg His Ala Leu Leu Ser Val 
115 120 125 

Glu Gly Leu Trp Val Glu Ser Phe 
130 135 

<210> 75 

<211> 132 

<212> PRT 

<213> Rhodobacter capsulatus 
<220> 

<22l> misc__f eature 

<222> (31)7. .(46) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc feature 

<222> (57)7.(60) 

<223> Xaa can -be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (75).. (86) 

<223> Xaa can be any naturally occurring amino acid 
<220> 
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<221> misc_f eature 
<222> (102) . . (108) 

<223> Xaa can be any naturally occurring amino acid 
<400> 75 

Gly Phe Ala Met Gly Ser Arg Val Ala Thr Met Asp Gly Leu Leu Pro 
15 10 15 



Val Glu Phe Leu Asn Leu Gly Asp Arg He Val Thr Arg Ser Xaa Xaa 

20 25 30 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Val 
35 40 45 



Gly He Ala Pro Gly Ala Leu Gly Xaa Xaa Xaa Xaa Gly Gin Ala Met 
50 55 60 



Val Leu Gly Ser Gly Thr Gin Val Leu Leu Xaa Xaa Xaa Xaa Xaa Xaa 
65 70 75 80 



Xaa Xaa Xaa Xaa Xaa Xaa Gin Ala Leu Val Ala Val Glu Arg Leu He 

85 90 95 



Asp Gly Gin Phe He Xaa Xaa Xaa Xaa Xaa Xaa Xaa He Arg He Phe 

100 105 110 



Ala Leu His Phe Glu Ala Pro Glu Val He Tyr Ala Asp Gly Val Glu 
115 120 125 



He Gly Cys Lys 
130 



<210> 76 

<211> 142 

<212> PRT 

<213> Rhodobacter sphaeroides 



<220> 

<221> misc_feature 

<222> (31) . . (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (45).. (56) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (67) . . (70) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (85).. (96) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc feature 

<222> (1127. . (118) 

<223> Xaa can be any naturally occurring amino acid 
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<400> 76 

Cys Phe Thr Pro Gly Thr Leu lie Ala Thr Val Arg Gly Glu Val Ala 
15 10 15 



Val Glu Ala Leu Ala Ala Gly Asp Arg lie Val Thr Arg Asp Xaa Xaa 

20 25 30 



Xaa Gin Pro Leu Arg Trp lie Ser Arg Arg Arg Leu Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Val Leu lie Glu Lys Gly Ser 
50 55 60 



Leu Gly Xaa Xaa Xaa Xaa Asp Arg Asp Met Met Val Ser Pro Asn His 
65 70 75 80 



Arg He Leu Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

85 90 95 



Glu Val Leu Val Ala Ala Lys His Leu Val Gly Pro Arg Gly He Xaa 

100 105 " 110 



Xaa Xaa Xaa Xaa Xaa Xaa Thr Thr Tyr Leu His Leu Met Phe Asp Arg 
115 120 " 125 



His Glu Val Val Leu Ala Asn Gly Ala Trp Thr Glu Ser Phe 
130 135 140 



<210> 77 

<211> 133 

<212> PRT 

<213> Rhodobacter sphaeroides 
<220> 

<221> misc_f eature 

<222> (31) . . (47) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (58) . . (61) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (76).. (87) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc__f eature 

<222> (103).. (109) 

<223> Xaa can be any naturally occurring amino acid 

<400> 77 

Ser Leu Thr Ala Gly Thr Pro Val Leu Thr Leu Ala Gly He Arg Pro 
15 10 15 



Ala Glu Gly He Arg Pro Gly Asp Arg Leu Val Ala Arg Ser Xaa Xaa 

20 25 30 
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Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Met 
35 40 45 



Val Ala He Gly Ala Ser Thr Leu Ala Xaa Xaa Xaa Xaa Asp Glu Thr 
50 55 60 



Leu Leu Val Pro Ala Asp Gin Pro Leu Leu Leu Xaa Xaa Xaa Xaa Xaa 
65 70 75 80 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Val Val Leu Pro Ala Arg Arg Leu 

85 90 95 



Val Asp Gly Gin Leu Thr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val Asp Leu 

100 105 110 



Val Thr Leu Thr Phe Ala Ala Pro Ala Ala He Tyr Ala Ser Glu Leu 
115 120 125 



His Pro Val Thr Arg 
130 



<210> 78 

<211> 132 

<212> PRT 

<213> Brucella suis 



<220> 

<221> misc_f eature 

<222> (31).. (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (45).. (56) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (67) . . (70) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (85).. (85) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (101) . . (108) 

<223> Xaa can be any naturally occurring amino acid 

<400> 78 

Cys Leu Leu Lys Gly Thr Leu Val Thr Thr Pro Asn Gly Pro Val Ala 
1 5 10 ' 15 

Val Glu Lys Leu Cys Val Gly Asp Leu Val Thr Thr Val Ser Xaa Xaa 

20 25 30 



Xaa Leu Pro He Lys Trp He Gly Trp Gin Asn Tyr Xaa Xaa Xaa Xaa 
35 40 45 
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Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro lie Arg Val Arg Arg His Ala 
50 55 60 



Leu Asp Xaa Xaa Xaa Xaa His Arg Asp Leu Tyr Leu Ser Pro Asn His 
65 70 75 80 

» 

Ala Leu Phe He Xaa Gly Val Leu He Arg Val Lys Asp Leu Val Asn 

85 90 95 

Gly Arg Ser He Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Asp Tyr Tyr 

100 105 no 

Asn He Val Leu Asp Arg His Ala Val Val Leu Ala Glu Gly Ala Ala 
115 120 125 

Val Glu Thr Phe 
130 



<210> 79 

<211> 132 

<212> PRT 

<213> Brucella melitensis 
<220> 

<221> misc_feature 

<222> (31).. (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (45).. (56) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (67) . . (70) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (85).. (85) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (101) . . (108) 

<223> Xaa can be any naturally occurring amino acid 

<400> 79 

Cys Leu Leu Lys Gly Thr Leu Val Thr Thr Pro Asn Gly Pro Val Ala 
1 5 io 15 

Val Glu Lys Leu Cys Val Gly Asp Leu Val Thr Thr Val Ser Xaa Xaa 

20 25 30 

Xaa Leu Pro He Lys Trp He Gly Trp Gin Asn Tyr Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro He Arg Val Arg Arg His Ala 
50 ■■ 55 60 
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Leu Asp Xaa Xaa Xaa Xaa His Arg Asp Leu Tyr Leu Ser Pro Asn His 
65 70 75 80 



Ala Leu Phe lie Xaa Gly Val Leu lie Arg Val Lys Asp Leu Val Asn 

85 90 "* 95 



Gly Arg Ser He Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Asp Tyr Tyr 

100 105 110 



Asn He Val Leu Asp Arg His Ala Val Val Leu Ala Glu Gly Ala Ala 
115 120 125 



Val Glu Thr Phe 
130 



<210> 80 

<211> 138 

<212> PRT 

<213> Unknown 

<220> 

<223> BIL1 domain sequence from an unknown origin 
<220> 

<221> misc_feature 

<222> (31)7.(38) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc__f eature 

<222> (50) . . (62) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (73) . . (76) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (91).. (91) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (107) . . (114) 

<223> Xaa .can be any naturally occurring amino acid 

<400> 80 

Cys Phe Leu Pro Gly Thr Met He Lys Thr Pro Ser Gly Glu Arg Pro 
15 10 15 



Val Glu Asp He Gin He Asn Asp Glu Val He Thr Phe Asp Xaa Xaa 

20 25 30 

Xaa Xaa Xaa Xaa Xaa Xaa Ser Lys He Lys Trp Val Gly Ser Lys Thr 
35 40 45 



He Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Val 
50 55 60 
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Arg lie Leu Lys Asn Ala lie Ser Xaa Xaa Xaa Xaa His Lys Asp Leu 
65 70 75 " 80 

Leu Val Thr Pro Glu His Cys Leu Phe Phe Xaa Gly Lys Phe lie Pro 

85 90 95 



Val Arg Met Leu Val Asn His Gin Thr lie Xaa Xaa Xaa Xaa Xaa Xaa 

100 105 110 

Xaa Xaa Tyr .Thr Tyr Tyr His He Glu Thr Glu Asn His Ser Val He 
115 120 125 



Tyr Ser Asp Gly Met Leu Thr Glu Ser Tyr 
130 135 



<210> 81 

<211> 138 

<212> PRT 

<213> Unknown 

<220> 

<223> BIL2 domain sequence from an unknown origin 
<220> 

<221> misc_f eature 

<222> (31).. (38) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eature 

<222> (46) . . (46) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (50) . . (62) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (73) . . (76) 

<223> Xaa can be any naturally occurring amino acid 
<220> ' 

<221> mi sc_f eature 

<222> (91).. (91) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_ feature 

<222> (107) . . (114) 

<223> Xaa can be any naturally occurring amino acid 

<400> 81 

Cys Phe Leu Ser Gly Thr Gin He Lys Thr Lys Leu Gly Val Lys Asn 

1 5 '10 15 

He Glu Ala Leu Gin Val Gly Asp Phe Val Thr Thr Tyr Asp Xaa Xaa 

20 25 " 30 




Xaa Xaa Xaa Xaa Xaa Xaa Arg Glu Val Thr Trp Val Gly Xaa Lys Tyr 
35 40 45 
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Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Val 
50 55 60 



Arg lie Val Lys Asp Ala lie Ala Xaa Xaa Xaa Xaa Tyr Lys Asp Leu 
65 70 75 80 



Leu Val Thr Ala Glu His Cys Leu Phe Phe Xaa Asp Lys Phe Jle Pro 

85 90 95 



Ala Arg Met Leu Val Asn Gly Ser Thr lie Xaa Xaa Xaa Xaa Xaa Xaa 

100 105 110 



Xaa Xaa Tyr Glu Tyr Tyr His Leu Glu Thr Gin Asp His Ala Val lie 
115 120 125 



lie Ala Asp Gly Val Arg Thr Glu Ser Tyr 
130 135 



<210> 82 

<211> 138 

<212> PRT 

<213> Methylobacterium extorquens 
<220> 

<221> misc_feature 

<222> (31).. (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (45) . . (56) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (67) . . (70) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (85) . . (92) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (108) . . (114) 

<223> Xaa can be any naturally occurring amino acid 

<400> 82 

Cys Phe Thr Thr. Gly Thr Leu Tie Arg Thr Ala Arg Gly Ser Val Ala 
1 5 10 15 

Val Glu Asp Leu lie Val Gly Asp Leu Ala Val Thr Ala Ser Xaa Xaa 

20 25 30 



Xaa Arg Pro lie Thr Trp He Gly Asn Arg Ala Leu Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro He Arg He Arg Ala Gly Ala 
50 55 60 
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Phe Gly Xaa Xaa Xaa Xaa Ala Arg Asp Leu Arg Leu Ser His Gly His 
65 70 75 80 

Pro Val Leu Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Val Leu Val 

85 90 - 95 

Pro Val Met Cys Leu He Asn Gly Thr Ser Val Xaa Xaa Xaa Xaa Xaa 

100 105 no 

Xaa Xaa Val .Thr Tyr Trp His He Glu Leu Asp Ala His Asp He Leu 
115 120 125 



Leu Ala Glu Gly Leu Ala Ala Glu Ser Tyr 
130 135 



<210> 83 

<211> 142 

<212> PRT 

<213> Silicibacter pomeroyi 
<220> 

<221> misc_feature 

<222> (31).. (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (45).. (56) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (67) . . (70) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (85).. (96) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (112).. (118) 

<223> Xaa can be any naturally occurring amino acid 

<400> 83 

Cys Phe Thr Pro Gly Thr Lys He Ala Thr Pro Lys Gly Glu Arg Leu 
1 5 io is 

Val Glu Asp Leu Glu Val Gly Asp Arg Val He Thr Arg Asp Xaa Xaa 

20 25 30 

Xaa Gin Glu He Arg Trp Val Gly Ser Arg Thr Leu Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Val Leu He Arg Gin Gly Ala 
50 55 60 

* ♦ 

* * 

Leu Gly Xaa Xaa Xaa Xaa Glu Arg Asp Met He Val Ser Pro Asn His 
65 70 75 80 
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Arg lie Leu Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

85 90 95 



Glu Val Leu Val Ala Ala Lys His Leu He Gly Leu Glu Gly Val Xaa 

100 105 110 



Xaa Xaa Xaa Xaa Xaa Xaa Val Thr Tyr He His Phe Met Phe Asp Gin 
115 120 125 



His Glu Val .Val Leu Ser Asp Gly Ala Trp Thr Glu Ser Phe 
130 135 140 



<210> 84 

<211> 134 

<212> PRT 

<213> Silicibacter pomeroyi 



<220> 

<221> misc_f eature 

<222> (31) . . (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (45).. (51) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_x~eature 

<222> (62).. (62) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc__feature 

<222> (77) . . (88) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<22l> misc_feature 

<222> (104) . . (110) 

<223> Xaa can be any naturally occurring amino acid 

<400> .84 

Cys Phe Cys Arg Gly Thr Leu He Ala Thr Ala Gly Gly Glu He Pro 

1 5 10 15 



Val Glu Lys Leu Arg Pro Gly Asp Arg Val He Thr Arg Asp Xaa Xaa 

20 25 30 



Xaa Gin Arg He Arg Trp He Gly Gly Thr Ser Arg Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Pro lie Arg He Arg Thr Gly Val Leu Lys Xaa Thr Arg 
50 55 ' 60 

Asp Leu Leu Val Ser Pro Asn His Arg He Leu Met Xaa Xaa Xaa Xaa 
•65 70 75 80 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Val Leu Val Ala Ala Lys Phe 

85 90 95 
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Leu Val Asp Gly Arg Ala lie Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val Asp 

100 105 110 



Tyr Tyr His Met Leu Phe Asp Gin His Glu Leu Val Leu Ser Glu Gin 
115 120 125 



Ala Trp Ser Glu Ser Phe 
130 



<210> 85 
<211> 134 
<212> PRT 



<213> Silicibacter pomeroyi 



<220> 

<221> misc_f eature 

<222> <31) . . (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (45).. (51) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eature 

<222> (62) . . (62) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (77).. (88) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (104) . . (110) 

<223> Xaa can be any naturally occurring amino acid 

<400> 85 

Cys Phe Ala Ala Gly Thr Arg lie Glu Thr Asp Arg Gly Gly Arg Ala 
1 5 10 15 



lie Glu Asp lie Ala Val Gly Asp Leu Val Leu Thr Arg Asp Xaa Xaa 

20 25 30 



Xaa Gin Pro Val Arg Trp Thr Gly Arg Arg Ser Val Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Pro lie Arg lie Ala Ser Gly Lys Leu Gly Xaa Leu Arg 
50 55 60 



Asp Leu Leu Val Ser Pro Gin His Arg Leu Leu Leu Xaa Xaa Xaa Xaa 
65 70 ' * 75 '80 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Val Leu Ala Ala Ala Val His 

85 90 95 



Leu Arg Asp Asp Arg His He Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val Thr 

100 105 110 
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Tyr Val His Leu Met Phe Asp Arg His Glu He He Tyr Ala Glu Gly 
115 120 125 



Val Ala Ser Glu Ser Phe 
130 



<210> 86 

<211> 143 

<212> PRT 

<213> Silicibacter pomeroyi 

* 

<220> 

<221> misc_feature 

<222> (31).. (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (45).. (56) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (67) . . (70) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (85).. (96) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (112) . . (119) 

<223> Xaa can be any naturally occurring amino acid 

<400> 86 

Cys Phe Thr Pro Gly Thr Arg He Ala Thr Pro Thr Gly Pro Arg Leu 
15 10 15 

He Glu Glu Leu Arg Glu Gly Asp Lys Val Gin Thr Arg Asp Xaa Xaa 

20 25 30 



Xaa Gin Glu He Gin Trp He Gly Gin Arg Arg Met Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro He Arg Met Arg Val Gly Ala 
50 55 .60 

■ 

Leu Gly Xaa Xaa Xaa Xaa Asp Ala Glu Leu Leu Val Ser Pro Glu His 
65 70 75 80 

~* 

Arg Met Leu Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

85 90 95 

Glu Val Leu Val Pro Ala Arg Asp Leu Val Asn Asp Ser Thr He Xaa 

100 105 HO 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val Thr Tyr Val His Leu Leu Leu Pro 
115 120 125 
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Ser His Gin He Leu Trp Ala Asn Gly He Glu Thr Glu Ser Phe 
130 135 140 



<210> 87 

<211> 134 

<212> PRT 

<213> Silicibacter pomeroyi 
<220> 

<221> misc — feature 

<222> (31).. (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (45).. (51) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (62) . . (62) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (77).. (88) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> rnisc__feature 

<222> (104) . . (110) 

<223> Xaa can be any naturally occurring amino acid 

<400> 87 

Cys Phe Ala Ala Gly Thr Phe He Glu He Glu Ser Gly Pro He Pro 
15 10 15 

Val Glu Thr Leu Arg Pro Gly Asp Leu Val Gin Thr Leu Asp Xaa Xaa 

20 25 30 

i 

Xaa Gin Pro Leu Leu Gin Leu Ala Lys Thr Thr Val Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Xaa Pro Val Leu Phe Arg Ala "Gly Val Leu Gly Xaa Phe Arg 
50 55 60 

Asp Leu Tyr Val Ser Gin Gin His Arg Met Leu He Xaa Xaa Xaa Xaa 
65 70 75 .30 

"Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Val Phe Val Pro Ala Arg Met 

85 90 95 

Leu Val Asn Gly Ser Thr lie Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Thr 

100 • 105 no 

Tyr Tyr His Leu Leu Phe Ala Arg His Glu He Val Phe Ser Glu Gly 
115 120 125 



He Pro Thr Glu Ser Tyr 
130 



WO 2004/043997 




72 



<210> 88 

<211> 134 

<212> PRT 

<213> Silicibacter pomeroyi 
<220> 

<221> misc_feature 

<222> (31) . . (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (45) . . (51) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (62).. (62) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

■ 

<221> misc_f eature 

<222> (77).. (88) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (104) . . (110) 

<223> Xaa can be any naturally occur^njj,,. agu^Q -acid 

<400> 88 

Cys Phe Val Ala Gly Thr Leu He Asp Thr Pro Tyr Gly Glu Arg Gin 
1 5 io 15 

Val Glu Arg Leu Thr Pro Gly Asp Gin Val Phe Thr Arg Asp Xaa Xaa 

20 25 30 

Xaa Gin Glu Val Arg Trp Val Gly Glu Arg Thr Val Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Xaa Pro He Leu He Arg Ala Gly Thr Tyr Gly Xaa Gin Arg 
50 55 60 

■ 

Asp Leu Met Val Ser Pro Gin His Arg He Leu He Xaa Xaa Xaa Xaa 
65 70 75 80 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Val Leu Val Ala Ala Lys Asp 

85 90 95 

Leu Val Asp Gly Arg Arg Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa He Thr 

100 105 no 

Tyr Val His Val Met Phe Asp Ser' His Gin Val He Tyr Ser Glu Gly 
115 120 125 

Leu Ala Ser Glu Ser Phe 
130 



<210> 89 

<211> 135 

<212> PRT 

<213> Silicibacter pomeroyi 
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<220> 

<221> misc_feature 

<222> (31).. (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (45).. (51) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (62).. (62) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> miscjFeature 

<222> (77).. (88) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (104) . . (Ill) 

<223> Xaa can be any naturally occurring amino acid 

<400> 89 

Ser Leu His Pro Glu Thr Pro He Ala Thr Pro Asp Gly Tyr Arg Pro 
1 5 10 15 



Leu Ser Lys He Arg Arg Gly Asp Thr Val He Val Ala Ser Xaa Xaa 

20 25 30 



Xaa Val Pro Val Leu His Arg Val Ser Arg Thr Met Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Pro Leu Thr He Arg Arg Pro Tyr Phe Gly Xaa Arg Gin 
50 55 60 



Asp He Gin Ala Ala Pro Ser Gin Arg Leu Leu Leu Xaa Xaa Xaa Xaa 
65 70 75 80 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ser Val Leu Val Pro Ala Arg His 

85 90 95 



Leu Thr Gly Gly His Ser Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ala 

100 105 110 



Thr Tyr Ala Gin Leu Leu Leu Pro Thr Asn Glu Ala Met He Thr Ala 
115 120 125 



Gly Ala Leu Ala Glu Ser Leu 
130 135 



<210> 90 

<211> 143 

<212> PRT 

<213> Silicibacter pomeroyi 



<220> 

<221> misc feature 
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<222> (31) . . (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (45) . . (56) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mis cofeature 

<222> (67) . . (70) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (85) . . (96) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (112) . . (119) 

<223> Xaa can be any naturally occurring amino acid 

<400> 90 

Cys Phe Val Ala Gly Ser Leu lie Asp Thr Val Glu Gly Pro Arg Pro 
1 5 10 15 

Val Glu Thr Leu Ala Val Gly Asp Leu Val Pro Val Glu Asp Xaa Xaa 

20 25 30 

Xaa Gin Pro lie Leu Trp lie Gly Lys Arg Thr Leu Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Val Arg He Arg Arg Asp Ala 
50 -55 60 

Leu Gly Xaa Xaa Xaa Xaa His Arg Thr Leu Trp Val Ser Pro Gin His 
65 70 75 80 

Arg He Val Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

85 90 95 

Gin Val Phe Ala Ala Ala He His Leu Thr Asn Asp Asp Thr He Xaa 

100 105 no 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val Thr Tyr Tyr His Leu Ala Phe Glu 
115 120 125 

Arg His Leu Leu Leu Arg Ala His Gly Leu Leu Ser Glu Ser He 
130 135 140 

<210> 91 

<211> 135 

<212> PRT 

<213> Silicibacter pomeroyi 
<220> 

<221> .mis cofeature 

<222> (31).. (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 
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<221> misc^feature 

<222> (45).. (51) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (62).. (63) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (78).. (89) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc — feature 

<222> (105) . . (Ill) 

<223> Xaa can be any naturally occurring amino acid 

<400> 91 

Cys Phe Thr Pro Gly Thr Leu lie Ala Thr Ala His Gly Pro Arg Ala 
1 5 .10 15 



lie Glu Thr Leu Arg Pro Gly Asp Leu lie Val Thr Arg Asp Xaa Xaa 

20 25 30 



Xaa Gin Pro Leu Arg Trp Val Gly Ser Arg Thr Val Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Pro He Arg Leu Asp Pro Thr Leu Leu Gin Xaa Xaa Ser 
50 55 60 



Ala Pro Leu Leu Val Ser Pro Gin His Arg Met Leu Trp Xaa Xaa Xaa 
65 70 75 80 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Val Leu Val Ala Ala Thr 

85 90 . 95 



His Leu Leu Gly Ser Pro Ala Ala Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val 

100 105 HO 

Thr Tyr Met His Leu Met Leu Asp Arg His Glu Val He Tyr Ala Asn 
115 120 125 



Asp Ala Ala Thr Glu Ser Phe 
130 135 



<210> 92 

<211> 134 

<212> PRT 

<213> Silicibacter pomeroyi 

* 

<220> 

<221> misc_feature 

<222> t31) . . (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> .(45) . . (51) 

<223> Xaa can be any naturally occurring amino acid 
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<220> 

<221> misc_feature 
<222> (62) . . (62) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 
<222> (77) . . (88) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 
<222> (104).. . (110) 

<223> Xaa can be any naturally occurring amino acid 
<400> 92 

Cys Phe Thr Pro Gly Thr He He Asp Thr Glu Asp Gly Pro Arg Leu 
15 10 15 



He Glu Glu Leu Gin Pro Gly Asp Leu He Arg Thr Leu Asp Xaa Xaa 

20 25 30 



Xaa Gin Pro Leu Arg Trp He Gly Arg Thr Thr Val Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Pro Val Leu He Arg Ala Gly Ala Leu Asp Xaa Arg Arg 
50 55 60 



Asp Leu He Val Ser Pro Gin His Arg Met Leu He Xaa Xaa Xaa Xaa 
65 70 75 80 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gin Ala Leu Val Ala Ala Lys His 

85 90 95 



Leu Val Asn Ala Arg Asp He Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val Thr 

100 105 110 



Tyr He His Leu Leu Phe Asp Arg His Glu He He Trp Ala Glu Gly 
115 120 125 



Cys Pro Thr Glu Ser Phe 
' 130 



<210> . 93 
<211> 143 
<212> PRT 

<213> Silicibacter pomeroyi 



<220> 

<221> misc_feature 

<222> (31).. (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (45).. (56) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (67) . . (70) 

<223> Xaa can be any naturally occurring amino acid 
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<220> 

<221> misc_feature 
<222> (85).. (96) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 
<222> (112) . . (119) 

<223> Xaa can be any naturally occurring amino acid 
<400> 93 

Cys Phe Ala Ala Gly Thr Arg lie Ala Thr Pro Lys Gly Ala Arg Pro 
15 10 15 



Val Glu Thr Leu Ala Val Gly Asp Leu Val Gin Thr Leu Asp Xaa Xaa 

20 25 30 



Xaa Gin Pro lie Arg Trp lie Gly Thr Arg Arg Val Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Val Val He Pro Ala His Ser 
50 55 60 



Phe Ala Xaa Xaa Xaa Xaa Thr His Pro Leu Leu Leu Ser Gin Gin His 
65 70 75 80 



Arg Val Leu Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

85 90 95 



Glu He Leu lie Ala Ala Arg Arg Leu Thr Gly Leu His Gly He Xaa 

100 105 110 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val Arg Tyr He His Phe Ala Leu Asp 
115 120 125 



Arg His Glu lie Val Phe Ala Asn Gly Leu Pro Ala Glu Thr Leu 
130 135 140 



<210> 94 

<211> 135 

<212> PRT 

<213> Silicibacter pomeroyi 



<220> 

<221> misc feature 

<222> (14)7.(14) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<22l> misc_feature 

<222> ' (32) . . (34) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (46).. (52) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (63).. (63) 
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<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 
<222> (78).. (89) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc__feature 
<222> (105) . . (Ill) 

<223> Xaa can be any naturally occurring amino acid 
<400> 94 

Ser Phe Thr Arg Gly Thr His He Thr Leu Gly Ser Gly Xaa Gin Val 
15 10 15 



Arg He Glu Asp Leu Lys Val Gly Asp Arg Val Leu Thr Arg Asp Xaa 

20 25 30 



Xaa Xaa Arg Glu Val Arg Trp He Gly Gin Thr Thr Val Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Pro He Val He Arg Ala Gly Thr Leu Asn Xaa Glu 
50 55 60 



Asn Asp Leu Val Val Ser Pro Asp His Arg Leu Phe Val Xaa Xaa Xaa 
65 70 75 80 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Leu Leu Leu Lys Ala Arg 

85 90 95 



His Leu Val Asn Gly Asp Thr Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val 

100 105 110 



Asp Tyr Phe Gin Leu Leu Phe Asp Arg His His He He Tyr Ala Glu 
115 120 125 



Gly He Ala Ala Glu Thr Met 
130 135 



<210> 95 

<211> 144 

<212> PRT 

<213> Silicibacter pomeroyi 



<220> 

<221> misc_f eature 

<222> (31).. (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mis cofeature 

<222> (45).. (57) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (68) . . (71) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc feature 
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<222> (86).. (97) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 
<222> (113).. (120) 

<223> Xaa can be any naturally occurring amino acid 
<400> 95 

Ala Phe Ser Arg Gly Ser Leu He Asp Thr Asp Cys Gly Pro Met Ala 
15 10 15 



He Glu Asp Leu Leu Pro Gly Asp Arg Val He Thr Gin Asp Xaa Xaa 

20 25 30 



Xaa Gin Glu Val Val Trp Lys Gly Ser Thr Val He Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Thr Arg He Met Ala Asp 
50 55 60 



Ala Phe Gly Xaa Xaa Xaa Xaa Met Ser Gly Val He Ala Gly Pro Ser 
65 70 75 80 



Ala Arg Leu Leu Ala Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

85 90 95 



Xaa Pro Met Leu Thr Pro Val Gin His Phe Val Asp Gly Met Gly He 

100 105 110 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa He Glu Val Phe His He Cys Leu 
115 120 125 



Arg Arg His Ala Val He Asn Val Asp Gly Leu Gin Phe Glu Thr Tyr 
130 135 140 



<210> 96 

<211> 125 

<212> PRT 

<213> Silicibacter pomeroyi 



<220> 

<221> misc_feature 

<222> (31) . . (49) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (60).. (60) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (75).. (86) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (102) . . (108) 

<223> .Xaa can be any naturally occurring amino acid 

<400> 96 
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Gly Leu Pro Ala Gly Thr Met Leu Glu Thr Glu Ala Gly Trp Ser Pro 
1 5 10 ' 15 



Val Glu Glu lie Arg Pro Gly Thr Arg Val Ala Thr lie Asp Xaa Xaa 

20 25 30 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Leu Trp Arg lie Pro Gly Gly Thr Leu Gly Xaa Cys Ser Asp Leu 
50 55 60 



Leu Leu Pro Glu Gly His Phe Leu Ala Leu Xaa Xaa Xaa Xaa Xaa Xaa 
65 70 75 80 



Xaa Xaa Xaa Xaa Xaa Xaa Thr Val Leu Ala Pro Val Ala Ala Leu Ala 

85 90 95 



Gly Phe Glu Gly lie Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Pro Ala His 

100 105 110 



Ser Leu Arg Phe Ala Glu Glu Glu Val Val Trp Ala Gin 
115 120 125 



<210> 97 

<211> 126 

<212> PRT 

<213> Silicibacter pomeroyi 
<220> 

<221> misc^f eature 

<222> (31) . . (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (45) . . (47) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc__feature 

<222> (58).. (61) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (76) . . (87) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc__f eature 

<222> (103) . . (109) 

<223> Xaa can be any naturally occurring amino acid 

<400> 97 

Gly Phe Leu Ala Gly Thr lie Leu Leu Thr Gin Asp Gly Glu Met Pro 
3- 5 10 15 



Val Glu Phe Leu Ser Pro Gly Asp Arg lie lie Thr Arg Asp Xaa Xaa 

20 25 30 
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Xaa Val Pro Leu His His He Thr Arg Ala Pro Gin Xaa Xaa Xaa Ala 
35 40 45 



He Arg He Ala Ala Gly Ser Leu Gly Xaa Xaa Xaa Xaa Asp Cys Asp 
50 55 60 



Leu He Leu Pro Ala Gly Gin Pro Val Leu He Xaa Xaa Xaa Xaa Xaa 
65 70 75 80 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gin Ala Met Val Arg Ala Asp Ala Leu 

85 90 95 



Val Asp Gly Glu Phe He Xaa Xaa Xaa Xaa Xaa Xaa Xaa Met Gin Leu 

100 105 110 



Phe Gin Leu His Phe Asp Ser Ala His Val Leu Tyr Ala Gly 
115 120 125 



<210> 98 

<211> 132 

<212> PRT 

<213> Silicibacter pomeroyi 



<220> 

<221> misc_feature 

<222> (31).. (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (45).. (55) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_£eature 

<222> (66).. (66) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (81).. (92) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (108) . . (115) 

<223> Xaa can be any naturally occurring amino acid 

<400> 98 

Gly Leu Leu Ala Gly Thr Ser Val Ala Ser Asn Phe Gly Trp Gin Pro 
1 5 10 15 

Val Glu Ala Leu Lys Val Gly Asp Lys Val Leu Thr Phe Asp Xaa Xaa 

20 25 30 



Xaa Gin Thr Val Ala Asp He Gin Arg Glu Thr Val Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Val Arg Leu Pro Glu Gly Val Cys 
50 55 60 
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His Xaa Arg Arg Asp Leu Trp Met Met Pro Asp Gin Gly Leu Leu Val 
65 70 75 80 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr Ala Val Val 

85 90 95 



Pro Ala Arg Met Leu Arg Gly Tyr Arg Gly lie Xaa Xaa Xaa Xaa Xaa 

100 105 110 



Xaa Xaa Xaa Val Glu Val Thr Thr Leu Ala Phe His Gin Asp Glu Val 
115 120 125 



He Tyr Val Glu 
130 



<210> 99 

<211> 134 

<212> PRT 

<213> Rhizobium leguminosarum 
<220> 

<221> misc_feature 

<222> (31).. (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (45) . . (56) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc — feature 

<222> (67) . . (70) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (85) . . (85) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (101) . . (110) 

<223> Xaa can be any naturally occurring amino acid 

<400> 99 

Cys Phe Leu Arg Gly Thr Ala He Leu Thr Asp Cys Gly Glu Lys Pro 
15 10 15 

Val .Glu Asn Leu Ser He Gly Asp Arg 11 Val Ala Leu Pro Asp Xaa Xaa 

20 25 30 



Xaa Arg Pro He Lys Trp Val Gly Arg Gin Ser Phe Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro He Arg Val Ser Arg His Ala 
50 55 60 



Leu Asp Xaa Xaa Xaa Xaa His Ser Asp Leu Tyr Leu Ser Pro Gly His 
65 70 75 80 
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Ala Leu Tyr Leu Xaa Gly lie Leu lie Gin Val Lys Asp Leu Val Asn 

85 90 95 



Gly Lys Thr He Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa He Glu 

100 105 110 



Tyr Tyr Ala Val Met Leu Asp Thr His Glu Val He Leu Ala Gly Gly 
115 120 125 



Ala Glu Thr Glu Ser Phe 
130 



<210> 100 

<211> 138 

<212> PRT 

<213> Magnetospirillum magnetotacticum 
<220> 

<221> misc_feature 

<222> (31) . . (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (45).. (56) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (67).. (70) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (85).. (92) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_ feature 

<222> (108) . . (114) 

<223> Xaa can be any naturally occurring amino acid 

<400> 100 

Cys Tyr Val Thr Gly Thr Arg He Arg Thr Glu Arg Gly Glu He Ala 
15 10 15 

Val Glu Asp Leu Gin Val Gly Asp Phe Ala Val Thr Ala Ser Xaa Xaa 

20 25 30 



Xaa Arg Pro He Thr Trp He Gly His Arg Glu He Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Val Arg Val Arg Ala Gly Ala 
50 55 60 

Phe Gly Xaa Xaa Xaa Xaa Val Asn Asp Leu Phe Leu Ser Pro Gly His 
65 70 75 80 



Pro Val Leu Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Val Leu Val 

85 90 95 
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Pro Val Met; Cys Leu lie Asn Gly Thr Thr lie Xaa^Xaa Xaa Xaa Xaa 
^ 100 105 110 

Xaa Xaa Val Thr Tyr Trp His Val Glu Leu Asp Ala His Asp He Leu 
115 120 125 



Leu Ala Glu Gly Leu Pro Ala Glu Ser Tyr 
130 135 



<210> 101 

<211> 138 

<212> PRT 

<213> Magnetospirillum magnetotacticum 



<220> 

<221> misc_f eature 

<222> (31) . . (33) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (45).. (56) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (67).. (70) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (85) . . (92) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (108) . . (114) 

<223> Xaa can be any naturally occurring amino acid, 

<400> 1*0 

* ■ 

Cys Phe Val Ser Gly Thr Arg He Ser Val Glu Arg Gly Ser He Pro 
1 5 10 15 



Val Glu Leu Leu Arg He Gly Glu Lys Ala Arg Leu Ala Ser Xaa Xaa 

20 25 30 



Xaa Arg Thr He Thr Trp He Gly His Arg Glu He Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Val Arg Val Arg Ala Gly Ala 
50 55 60 



Phe Gly Xaa Xaa Xaa Xaa Ala Arg Asp Leu Phe Leu Ser Pro Gly His 
65 70 75 80 



Pro Val Leu He Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Val Leu Val 

85 90 95 



Pro val Met Cys Leu He Asn Gly Thr Ser He Xaa Xaa Xaa Xaa Xaa 

100 105 110 
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Xaa Xaa Val Thr Tyr Trp His Val Giu Leu Asp Arg His Asp lie Leu 
115 120 125 



Leu Ala Giu Gly Leu Pro Ala Giu Ser Tyr 
130 135 



<210> 102 

<211> 98 

<212> PRT 

<213> Magnetospirillum magnetotacticum 



<220> 

<221> misc^featur.e,, 

<222> (31).. (33)" 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (45) . . (56) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (67) . . (70) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (85).. (92) 

<223> Xaa can be any naturally occurring amino acid 

<400> 102 

Cys Phe Val Thr Gly Thr Met He Ala Thr Ala Arg Gly Giu Val Ala 
15 10 15 



Val Giu Asp Leu Arg Ala Gly Asp Phe Ala Arg Thr Ala Giu Xaa Xaa 

20 25 30 



Xaa Arg Pro He Val Trp He Gly His Arg Giu He Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Val Arg Val Arg Thr Gly Ala 

50 55 " 60 



Phe Gly Xaa Xaa Xaa Xaa Ala Arg Asp Leu Tyr Leu Ser Pro Gly His 
65 70 75 80 



Pro Val Leu Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Thr Leu Val 

85 90 95 



Pro He 



<210> 103 

<211> 24 

<212> PRT 

<213> Magnetospirillum magnetotacticum 



<400> 103 
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Val Thr Tyr Trp His Val Glu Leu Asp Ala His Asp lie Leu Leu Ala 
1 5 10 15 



Glu Gly Leu Pro Ala Glu Ser Tyr 

20 



<210> 
<211> 
<212> 
<213> 



104 
7 

PRT 



Magnetospirillum magnetotacticum 
<400> 104 

Arg Leu Pro Ala Glu Ser Tyr 
1 5 



<210> 105 

<211> 57 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Type B BIL domain consensus sequence 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



mis cofeature 
(1) : . (1) 
C, S, or P 



misc_feature 
(2) . . (2) 
F, L, or V 



misc_feature 
(3) . . (4) 

Xaa can be any naturally occurring amino acid 

misc_f eature 
(5) . . (5) 

G, S, W, D, or T 



misc_f eature 
(6) . . (6) 

T, E, S, A, or M 



misc__feature 
(7) . . <7) 

Xaa can be any naturally occurring amino acid 



<220> 

<221>-~ misc__feature 
<222> (8).. (8) 
<223> V, I, L, or M 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



misc_f eature 
(9) . . (9) 

Xaa can be any naturally occurring amino acid 



misc_f eature 

(10) . . (10) 

T, M, L, or K 
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<220> 

<221> misc_feature 

<222> (11) . . (12) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (13) . . (13) 

<223> G or N 



<220> 

<221> misc_feature 

<222> (14) . ; . (14) 

<223> 0-100 amino acids of any kind 



<220> 

<221> misc_feature 

<222> (15).. (15) 

<223> I, V, A, or L 



<220> 

<221> misc__ feature 

<222> (16) . . (16) 

<223> E, h, D, G, or S 



<220> 

<221> misc__f eature 

<222> (17) . . (17) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (18) . . (18) 

<223> I, L, V, or F 

<220> 

<221> misc_feature 

<222> (19) . . (20) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (22).. (22) 

<223> D, E, or M 



<220> 

<221> misc_feature 

<222> (23).. (23) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (24).. (24) 

<223> V or I 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 



misc_feature 
(25) . . (25) 

Xaa can be any naturally occurring amino acid 



misc_feature 
(26) . . (26) 
A, S, T, or G 



mis cofeature 
(27). . (27) 

0-100 amino acids of any kind 



misc_feature 
(28) . . (28) 
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<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
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L, I , F, or V 



misc_feature 
(29) . . (29) 

Xaa can be any naturally occurring amino acid 



misc_f eature 
(30) . . (30) 

S, T, V, A, V, or L 



misc_feature 
(31) . . (31) 
T, N, or S 



misc_feature 
(32) . . (33) 

Xaa can be any naturally occurring amino acid 



misc_f eature 
(35) . . (35) 
P, L, or R 



misc_f eature 
(36) . . (36) 
F, I, W, Y, or V 



misc_feature 
(37) . . (37) 

0-200 amino acids of any kind 



misc_f eature 
(38) . . (38) 
A, V, T, or M 



mis cofeature 
(39) . . (39) 
Y, F, H, or M 



misc_feature 
(40) . . (40) 
N, D/ R, or K 



misc^f eature 
(41) . . (41) 
L, F, I, or M 



misc_f eature 
(42) .. (42.) 

Xaa can be any naturally occurring amino acid 



misc_f eature 
(43) . . (43) 
V, I, or L 



misc^f eature 
(44) . . (46) 

Xaa can be any naturally occurring amino acid 



WO 2004/043997 



<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 
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misc_feature 
(47) .. (47) 
T, S, N, or E 



misc_feature 
(48) . . (48) 
T, S, N, or E 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



misc_feature 
(49) . . (49) 
Y, F,; or L 



misc_f eature 
(50) . . (50) 

Xaa can be any naturally occurring amino acid 



mi sc_f eature 
(51) (51) 
V, A, or I 



misc_feature 
(52) . . (52) 

0-100 amino acids of any kind 



mi sc_f eature 
(53) . . (53) 
W, L, V, or Y 



mi sc_f eature 
(54) . . (54) 
V, A, or S 



mis cofeature 
(56) . . <56) 
N or D 



<220> 
<221> 
<222> 
<223> 



misc_feature ; 
(57)..T57) 

Xaa can be any naturally occurring amino acid 



<400> 105 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
15 10 15 



Xaa Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

20 25 30 



Xaa His Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa His Xaa Xaa 
50 55 



<210> 106 

<211> 88 

<212> PRT 

<213> artificial sequence 



<220> 
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<223> Type B BIL domain consensus sequence 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



misc_f eature 
(1) . . (1) 
C, G, A or S 



misc_f eature 
(2) . . (2) 
F, L or Y 



mi sc_f eature 
(3) . . (4) 

Xaa can be any naturally occurring amino acid 



misc_f eature 
(5) . . (5) 
G, E, S or T 



mi sc_f eature 
(6) . . (6) 
T or S 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



mi sc_f eature 
(7) . . (7) 
Xaa can be any 



mi s c__f ea t u r e 
(8) (8) 
I, V or L 



naturally occurring amino acid 



jnisc_feature 
(9) . . (12) 

Xaa can be any naturally occurring amino acid 



misc_ f eature 
(14) . . (14) 
0-100 amino acids 



mi sc^JE eature 
(15) . . (15) 
V, I, A or L 



mi sc__f eature 
(16) . . (16) 
E, S, A or D 



misc__feature 
(17) : . (17) 
Xaa can be any 



of any kind 



naturally occurring amino acid 



misc feature 
(18)7. (18) 
L or I 



<220> 
<221> 
<222> 
<223> 



mi sc_f eature 
(19) f . (20) 
Xaa can be any 



naturally occurring amino acid 
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<220> 

<221> misc_feature 

<222> (21).. (21) 

<223> G, E or N 

<220> 

<221> misc__feature 

<222> (22) . . (22) 

<223> D, E/ A, Q or T 

<220> 

<221> misc_feature 

<222> (23) . ; . (23) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (24).. (24) 

<223> V, I, A, L or M 

<220> 

<221> misc_feature 

<222> (25).. (25) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> (26) . . (26) 

<223> T, V, L or A 

<220> 

<221> misc_feature 

<222> (27).. (27) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (28) . . (28) 

<223> D, S, E or T 

<220> 

<221> misc_feature 

<222> (29) . . (29) 

<223> 0-100 amino acids of any kind 
<220> 

<221> misc_feature 

<222> (30) . . (30) 

<223> P, L, A or M 

<220> 

<221> misc_feature 

<222> (31).. (31) 

<223> V, I , L, T or W 

<220> 

<221> jnisc_f eature 

<222> (32) . . (32) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> <33) ..* (33) 

<223> I, L, V, F or M 

<220> 

<221> misc_f eature 

<222> (34) . . (38) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_f eature 

<222> 139).. (39) 
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<223> A, D, G, H, K, N, Q or S 



<220> 

<221> misc_feature 

<222> (40) . . (40) 

<223> 0-100 amino acids of any kind 
<220> 

<221> misc_feature 

<222> (41) . . (41) 

<223> A, C, D, E, F, G, H, L, M, Q, R, S, T, V or Y 
<220> 

<221> misc_feature 

<222> (42).. (42) 

<223> A, C, E, G, H, K, L, N, Q, R, S or T 
<220> 

<221> mis cofeature 

<222> (43) . . (43) 

<223> A, D, E, G, N, P or T 

<220> 

<221> misc_feature 

<222> (44) . . (44) 

<223> I, h, M, R or V 

<220> 

<221> misc_feature 

<222> (45) . . (45) 

<223> F, I, L, M, Q, R, V, W or Y 



<220> 

<221> misc_feature 

<222> (46) . . (46) 

<223> A, h, M or V 



<220> 

<221> misc_f eature 

<222> (47).. (47) 

<223> A, G, L, M, P, S, or T 



<220> 

<221> misc_f eature 

<222> (48) . . (48) 

<223> A, E/ G, H, P, Q or S 

<220> 

<221> misc_feature 

<222> (49).. (49) 

<223> D, E, G, H, L, N, Q, R or S 



<220> 

<221> misc_feature 

<222> (50) . . (50) 

<223> A, H, Q or T 

<220> 

<221> misc_feature 

<222> (51) . . (51) 

<223> A, C, F, G, P, Q or R 
* 

<220> 

<221> misc__f eature 

<222> (52) . . (52) 

<223> F, I, L, M or V 

<220> 

<221> misc feature 

<222> (53)7. (53) 

<223> A, C, F r L, V or Y 



<220> 
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<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<22.2> 
<223> 

<220> 
<221> 
<222> 
<223> 



93 

misc_feature 
(54) . . (54) 

A, F, I, L, M, T, V or W 



mi sc_f eature 
(55) . . (55) 

0-100 amino acids of any kind 



mis cofeature 
(56) . . (56) 

A, D, ; E, G, L, P, Q, S, T or Y 



misc_feature 
(57) (57) 

A, F, I, K, L, M, R, T or V 



mis cofeature 

(58) . . (58) 

D, F, It, M or V 



misc_feature 
(59) . . (59) 
A, I, L, T or V 



misc_f eature 
(60) (60) 
A, K, M, P, Q or R 



mis cofeature 
(61) . . (61) 
A, I or V 



misc_feature 
(62) . . (62) 

A, C, D, E, G, H, I, K, L, M, Q, R, T or V 



mis cofeature 
(63) . . (63) 

A, C, D, F, H, M, R, T or V 



misc_f eature 
(64) . . (64) 
A, F, L or M 



misc_f eature 
(65) . . (65) 

A, G, I , L, R, T or V 



misc_f eature 
(66) . . (66) 
A, D, G, L, N or P 



mis cofeature 
(67) . . (67) 

A, D, F, G, H, K, L, N, P, Q, S, T or Y 



misc_feature 
(68) . . (68)" 

A, D, E, G., H, I , M, N, P, Q, R, S, T or K 
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<220> 

<221> misc_feature 

<222> (69) . . (69) 

<223> A, D, E, F, G, H, L, M, Q, R, S, T or W 



<220> 

<221> misc__f eature 

<222> 110) . . (70) 

<223> A, G, I, P, T or V 



<220> 

<221> mis cofeature 

<222> (71)7. (71) 

<223> 0-100 amino acids of any kind 
<220> 

<221> misc_feature 

<222> (72).. (72) 

<223> A, I, L, M, Q, T, V or Y 
<220> 

<221> misc_feature 

<222> (73).. (73) 

<223> D, E, K, L, P, Q, R, S, T or V 
<220> 

<221> misc__feature 

<222> (74) . . (74) 

<223> A, I, L, V, W or Y 

<220> 

<221> misc_feature 

<222> (75) . . (75) 

<223> A, F, H, I, I», M, T, V, W or Y 
<220> 

<221> misc_feature 

<222> (76).. (76) 

<223> A, G, H, N, Q, R, S or T 



<220> 

<221> misc^feature 

<222> (77) . . (77) 

<223> F, I, L, M or V 



<220> 

<221> misc_feature 

<222> (78).. (78) 

<223> A, C, E, H, L, 

<220> 

<221> misc_f eature 

<222> (79).. (79) 

<223> F, L, M or T 

<220> 

<221> misc__f eature 

<222> (80) . . {80) 

<223> A, D, E, H, P, 

. <220> 

<221> misc_feature 

<222> (81).. (81) 

<223> A, C, D, E y G, 



M, R, T or V 



Q or R 



H, N, Q, R, S or T 



<220> 

<221> misc^f eature 

<222> (82) . . (82) 

<223> A, D, E, H, N or P 

<220> 

<221> mlsc feature 
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<222> (83).. (83) 

<223> A, D, B, G, H, L, Q or S 

<220> 

<221> misc_feature 

<222> (84).. (84) 

<223> A, I, L, M, S or V 

<220> 

<221> misc_feature 

<222> (85) . . (85) 

<223> I, L, M or V 

<220> 

<221> mis cofeature 

<222> (86).. (86) 

<223> F, L, N, R, S, T, V, W or Y 
<220> 

<221> misc__feature 

<222> (87).. (87) 

<223> A, G, S, T or V 

<220> 

<221> misc_feature 

<222> (88) . . (88) 

<223> A, D, E, G, H, N, Q or S 

<400> 106 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa Xaa 
1 5 10 15 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

20 25 30 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
50 55 -60 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
65 70 75 80 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

85 



<210> 107 

<211> 149 

<212> PRT 

<213> Pseudomonas syringae 

<400> 107 

Cys Phe Ala Ala Gly Thr Met Val Ser Thr Pro Asp Gly Glu Arg Ala 
1 5 10 15 

* 

lie Asp Thr Leu Lys Val Gly Asp lie Val Trp Ser Lys Pro Glu Gly 

20 25 30 



•Gly Gly Lys Pro Phe Ala Ala Ala He Leu Ala Thr His He Arg Thr 
35 40 4 5 
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Asp Gin Pro lie Tyr Arg Leu Lys Leu Lys Gly Lys Gin Glu Asn Gly 
50 55 60 



Gin Ala Glu Asp Glu Ser Leu Leu Val Thr Pro Gly His Pro Phe Tyr 
65 70 75 80 

Val Pro Ala Gin His Gly Phe Val Pro Val lie Asp Leu Lys Pro Gly 

85 90 95 



Asp Arg Leu Gin Ser Leu Ala Asp Gly Ala Ser Glu Asn Thr Ser Ser 

100 105 110 

Glu Val Glu Ser Leu Glu Leu Tyr Leu Pro Val Gly Lys Thr Tyr Asn 
115 120 125 



Leu Thr Val Asp Val Gly His Thr Phe Tyr Val Gly Lys Leu Lys Thr 
130 135 140 



Trp Val His Asn Thr 
145 



<210> 108 
<211> 461 
<212> PRT 

<213> artificial sequence 
<220> 

<223> MBP-CBD splicing product 
<400> 108 

Met Lys Thr Glu Glu Gly Lys Leu Val lie Trp lie Asn Gly Asp Lys 
15 10 15 

Gly Tyr Asn Gly Leu Ala Glu Val Gly Lys Lys Phe Glu Lys Asp Thr 

20 25 30 

Gly lie Lys Val Thr Val Glu His Pro Asp Lys Leu Glu Glu Lys Phe 
35 40 45 

Pro Gin Val Ala Ala Thr Gly Asp Gly Pro Asp He He Phe Trp Ala 
50 55 60 

His Asp Arg Phe Gly Gly Tyr Ala Gin Ser Gly Leu Leu Ala Glu lie 
65 70 75 80 

Thr Pro Asp Lys Ala Phe Gin Asp Lys Leu Tyr Pro Phe Thr Trp Asp 

85 90 95 

Ala Val Arg Tyr Asn Gly Lys Leu He Ala Tyr Pro He Ala Val Glu 

1Q0 105 110 

Ala Leu Ser Leu He Tyr Asn Lys Asp Leu Leu Pro Asn Pro Pro Lys 
. H5 120 125 

4 

Thr Trp Glu Glu He Pro Ala Leu Asp Lys Glu Leu Lys Ala Lys Gly 
130 135 140 
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Lys Ser Ala Leu Met Phe Asn Leu Gin Glu Pro Tyr Phe Thr Trp Pro 
145 150 155 160 



Leu lie Ala Ala Asp Gly Gly Tyr Ala Phe Lys Tyr Glu Asn Gly Lys 

165 170 " 175 



Tyr Asp He Lys Asp Val Gly Val Asp Asn Ala Gly Ala Lys Phe Leu 

180 185 190 



Val Asp Leu He Ala Gly Leu Thr Lys Asn Lys His Met Asn Ala Asp 
195 200 205 



Thr Asp Tyr Ser He Ala Glu Ala Ala Phe Asn Lys Gly Glu Thr Ala 
210 215 220 



Met Thr He Asn Gly Pro Trp Ala Trp Ser Asn He Asp Thr Ser Lys 
225 230 235 * 240 



Val Asn Tyr Gly Val Thr Val Leu Pro Thr Phe Lys Gly Gin Pro Ser 

245 250 * " 255 



Lys Pro Phe Val Gly Val Leu Ser Ala Gly He Asn Ala Ala Ser Pro 

260 265 270 



Asn Lys Glu Leu Ala Lys Glu Phe Leu Glu Asn Tyr Leu Leu Thr Asp 
275 280 285 



Glu Gly Leu Glu Ala Val Asn Lys Asp Lys Pro Leu Gly Ala Val Ala 
290 295 * 300 



Leu Lys Ser Tyr Glu Glu Glu Leu Ala Lys Asp Pro Arg He Ala Ala 
305 310 315 320 



Thr Met Glu Asn Ala Gin Lys Gly Glu He Met Pro Asn lie Pro Gin 

325 330 335 



•Met Ser Ala Phe Trp Tyr Ala Val Arg Thr Ala Val He Asn Ala Ala 

340 345 350 



Ser Gly Arg Gin Thr Val Asp Glu Ala Leu Lys Asp Ala Gin Thr Asn 
355 360 365 



Ser Ser Ser Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn Leu Gly He 
370 375 380 



Glu Gly Arg He Ser Glu Phe Gly Ser Thr Ser Arg Val Asp Cys Gly 
385 390 395 400 



Gly Leu Thr Gly Leu Asn Ser Gly Leu Thr Thr Asn Pro Gly Val Ser 

405 410 415 



Ala Trp Gin Val Asn Thr Ala Tyr Thr Ala Gly Gin Leu Val Thr Tyr 

420 425 430 
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Asn Gly Lys Thr Tyr Lys Cys Leu Gin Pro His Thr Ser Leu Ala Gly 
435 440 445 



Trp Glu Pro Ser Asn Val Pro Ala Leu Trp Gin Leu Gin 
450 455 460 



<210> 109 

<211> 541 

<212> PRT 

<213> artificial sequence 

* 

<220> 

<223> MBP-PsyBIL carboxy terminal cleavage product 
<400> 109 

Met Lys Thr Glu- Glu Gly Lys Leu Val lie Trp He Asn Gly Asp Lys 
1 5 10 15 



Gly Tyr Asn Gly Leu Ala Glu Val Gly Lys Lys Phe Glu Lys Asp Thr 

20 25 30 



Gly He Lys Val Thr Val Glu His Pro Asp Lys Leu Glu Glu Lys Phe 
35 40 45 



Pro Gin Val Ala Ala Thr Gly Asp Gly Pro Asp He He Phe Trp Ala 
50 55 60 



His Asp Arg Phe Gly Gly Tyr Ala Gin Ser Gly Leu Leu Ala Glu He 
65 70 75 80 



Thr Pro Asp Lys Ala Phe Gin Asp Lys Leu Tyr Pro Phe Thr Trp Asp 

85 90 95 



Ala Val Arg Tyr Asn Gly Lys Leu He Ala Tyr Pro He Ala Val Glu 

100 105 110 



Ala Leu Ser Leu He Tyr Asn Lys Asp Leu Leu Pro Asn Pro Pro Lys 
115 120 125 



Thr Trp Glu Glu He Pro Ala Leu Asp Lys Glu Leu" Lys Ala Lys Gly 
130 135 140 



Lys Ser Ala Leu Met Phe Asn Leu Gin Glu Pro Tyr Phe Thr Trp Pro 
145 ISO 155 160 



Leu He Ala Ala Asp Gly Gly Tyr Ala Phe Lys Tyr Glu Asn Gly Lys 

165 . 170 175 



Tyr Asp lie Lys Asp Val Gly Val Asp Asn Ala Gly Ala Lys Ala Gly 

180 185 190 



Leu Thr Phe Leu Val Asp Leu He Lys Asn Lys His Met Asn Ala Asp 
195 200 205 



Thr Asp Tyr Ser He Ala Glu Ala Ala Phe Asn Lys Gly Glu Thr Ala 
210 215 220 
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Met Thr lie Asn Gly Pro Trp Ala Trp Ser Asn lie Asp Thr Ser Lys 
225 230 235 240 



Val Asn Tyr Gly Val Thr Val Leu Pro Thr Phe Lys Gly Gin Pro Ser 

245 250 ' 255 



Lys Pro Phe Val Gly Val Leu Ser Ala Gly lie Asn Ala Ala Ser Pro 

260 265 270 



Asn Lys Glu Leu Ala Lys Glu Phe Leu Glu Asn Tyr Leu Leu Thr Asp 
275 280 285 



Glu Gly Leu Glu Ala Val Asn Lys Asp Lys Pro Leu Gly Ala Val Ala 
290 295 300 



Leu Lys Ser Tyr Glu Glu Glu Leu Ala Lys Asp Pro Arg He Ala Ala 
305 310 315 " 320 



Thr Met Glu Asn Ala Gin Lys Gly Glu He Met Pro Asn He Pro Gin 

325 330 335 



Met Ser Ala Phe Trp Tyr Ala Val Arg Thr Ala Val He Asn Ala Ala 

340 345 350 



Ser Gly Arg Gin Thr Val Asp Glu Ala Leu Lys Asp Ala Gin Thr Asn 
355 360 365 



Ser Ser Ser Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn .Leu Gly He 
370 375 380 



Glu Gly Arg He Ser Glu Phe Gly Ser Cys Phe Ala Ala Gly Thr Met 
385 390 395 400 



Val Ser Thr Pro Asp <31y Glu Arg Ala He Asp Thr Leu Lys Val Gly 

405 410 415 



Asp He Val Trp Ser Lys Pro Glu Gly Gly Gly Lys Pro Phe Ala Ala 

420 425 " 430 



Ala He Leu Ala Thr His He Arg Thr Asp Gin Pro He Tyr Arg Leu 
435 440 445 



Lys Leu Lys Gly Lys Gin Glu Asn Gly Gin Ala Glu Asp Glu Ser Leu 
450 455 460 



Leu Val Thr Pro Gly His Pro Phe Tyr Val Pro Ala Gin His Gly Phe 
465 470 475 480 



Val Pro, Val He Asp Leu Lys Pro Gly Asp Arg Leu Gin Ser Leu Ala 

485 490 495 



Asp Gly Ala Ser Glu Asn Thr Ser Ser Glu Val Glu Ser Leu Glu Leu 

500 505 510 
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Tyr Leu Pro Val Gly Lys Thr Tyr Asn Leu Thr Val Asp Val Gly His 
515 ^ 520 525 



Thr Phe Tyr Val Gly Lys Leu Lys Thr Trp Val His Asn 
530 535 540 



